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results obtained from S-190A color versus S-190B color, between 
S-190A color versus S-190A color IR or between two sampling 
methods evaluated. There was a significant difference in sampling 
precision between June and September imagery and between EREP 
and aircraft imagery. 



PREFACE 


Objecti ves . This SKYLAB investigation had three main objectives. 

These were (1) to investigate the applicability of EREP S-190A, S-190B, 
and ST*19/i data for improving the precision of multistage forest inventories, 
(2) to develop a precision analytical annotation technique so that sample 
units as small as one mile square could be reliably located on S-190 
images as well as in the coordinate system of the S-192 digital tapes, 
and (3) to develop an interpretive model capable of deriving forest- 
related variables from the above EREP data. 

Background . The first application of multistage sampling 
using space and aerial photography for forest surveys was conducted with 
data obtained by the Apollo 9 astronauts in 1969 (Langley, et al . , 1969). 
While lower altitude aerial photography had been used in forest surveys 
for many years, the advent of high-altitude aircraft and spacecraft 
generated an interest in the application of small-scale imagery to these 
surveys. While the Apollo 9 experiment demonstrated that space imagery 
could provide data to improve the precision of forest surveys, it 
remained to be determined if the methods (1) could be applied to sample 
units prespecified on the ground, (2) could be applied in other areas, 
and (3) could take advantage of the better resolution and mul tispectral 
characteristics of EREP data in forest surveys. Another relatively 
minor objective was to determine if the sampling method had any 
significant effect on the gain in precision and stability of timber 
volume estimates derived from space data. These questions were addressed 
in the present study. 
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Scope . The test site used in this investigation was the aggregation 
of nearly 200 land sections, one square mile in size, in Trinity County, 
California, owned by the Southern Pacific Land Company. These sections 
served as primary sample units in experiments aimed at evaluating the 
gain in sampling precision that could be achieved by introducing SKYLAB- 
EREP data into the sampling plan. 

The first stage sampling methods evaluated for relative gains 
compared to simple random sampling were stratified, variable probability, 
and regression. 

The precision image annotation system that was developed extended 
the existing capabil ity of annotating the corners of primary sample 
units within the geometry of aerial photographs, LANDSAT imagery, and 
the LANDSAT MSS to SKYLAB-EREP S-190 imagery and S-192 data tapes. 

Image overlays were produced by computer methods for visually identifying 
the sample unit corners on each of the image types. Concurrently, the 
coordinates of the points in the computer tape system can be determined. 
Hence, specific sample units are addressable over spectral bands and time 
for interpretation purposes. 

The annotation system is capable of correcting for image distortions 
caused by earth curvature, terrain relief, and systematic distortions 
inherent in the imaging system. 

The image interpretation experiments included a machine-assisted 
technique using a density slicer and purely human techniques. A number 
of primary sample units (psu's) were examined and evaluated for the 
proportion of area exhibiting a specific tonal or color response* 
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These responses were used in correlation models to determine their 
relationship to the timber volumes present in each psu. From these 
data, the potential gain achievable from each image type in each of 
three sampling methods was determined. 

Results . 

1. The RMSE of point location achieved with the annotation system 
on S-190A imagery was 100 meters and 90 meters in the x and y direction, 
respectively. 

2. The potential gains in sampling precision attributable to space 
derived imagery ranged from 4.9 to 43.3 percent depending on the image 
type, interpretation method, time of year, and sampling method applied. 
These results can be compared with the 55.1 percent gain achieved by 
means of human interpretation methods applied to high- flight aircraft 
photography. 

3. There was no signifcant difference in sampling gains achieved 
with the S-190B color imagery compared to the S-190A color imagery. 

4. There was no significant difference in sampling gains when 
comparing S-190A color with S-190A color IR imagery. 

5. Seasonal variation was significant. S-190A products obtained 
in September yielded higher gains than those obtained in June. S-190A 
color IR composites from both time periods yielded the highest results 
of all 190A and 190B products tried. However, no temporal combina- 
tions were possible with the S-190B imagery. 
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6. Using 100 primary sample units as a base under simple random 
sampling, the revenue made available for incorporating space acquired 
data into the sample design to estimate timber volume was as high as 
$39,400.00. 
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THE USEFULNESS OF SKYLAB/EREP S-190 AND S-192 IMAGERY 
IN MULTISTAGE FOREST SURVEYS 

1.0 INTRODUCTION 

1.1 BACKGROUND 

One of the great potential benefits of remote sensing from space is 
the increased ability to inventory and evaluate earth resources. In 
forestry, as in other areas, this offers an opportunUy to develop 
dynamic resource inventory systems to provide current statistics needed 
in management planning, economic development, and policy formulation. 

Many private organizations, federal and state agencies, and foreign 
governments conduct forest surveys on a regular basis to determine the 
quantity, quality,! and distribution of timber resources within areas of 
particular interest to them. Most of these surveys are conducted 
periodically at five-, ten-,or even twenty-year intervals. However, the 
increasing pace of wood utilization, coupled with the increasing pressure 
on available forest land from diverse user groups, particularly in the! 
U.S. , is making it increasingly difficult just to keep up with the 
changes in the forest resource picture. What will ultimately be needed 
to help ease this situation is a forest resource inventory system able 
to provide timely estimates efficiently. The advent of space acquired 
imagery, with its potential uniformity of coverage ovrfr large areas, 
should contribute significantly toward attaining this goal. 

During the spring and summer of 1969, a multistage forest survey 
experiment was conducted on approximately 10 million acres in the 
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Mississippi Valley area and Georgia using Apollo 9 photography, three 
scales of aerial photography, and ground data. Among the objectives 
of that pilot survey were (!) to demonstrate in a preliminary fashion 
the applicability of a new sampling theory to resource surveys, and (2) 
to show how the information obtainable from space imagery could be 
profitably utilized to reduce the sampling error of the timber volume 
estimates obtained from the survey. 

The sampling theory used in the Apollo 9 study, and extended in 
the, present SKYLAB investigation, was developed by P. G. Langley, the 
principal investigator, as a needed addition to the body of general 
theory which deals with resource sample surveys using space- and aerial- 
derived information.--^ Some of the theoretical properties of the multi- 
stage variable probability sampling estimator were developed also by 
Langley for the Earth Resources Survey Program, Office of Space Sciences 
and Applications, NASA. The investigation is titled, “The Development 
of an Earth Resources Information System Using Aerial Photographs and 
Digital Computers" (Langlf.y, et al . , 1970). The optimum allocation 
theory was completed during the present SKYLAB investigation. 

While the Apollo 9 study proved successful in its objectives and the 
study clearly demonstrated the potential of space imagery in earth resources 
surveys and information systems , it also highlighted the fact that much 


This general theory has proved particularly useful for management plan 
inventories that are developed in conjunction with resource information 
systems. The information stored in these systems is frequently used to 


generate supplementary variables for use in the sample design. The 
results of sample surveys may be subsequently used to improve the 
information system. Thus, the information system may continually improve 
and each resource survey may become more efficient. 




development work must be conducted in mu] tispectral image interpretation 
and processing before space imagery can be effectively utilized in 
operational earth resource survey systems. A subsequent paper (Langley, 
1971) outlined some of the problems and opportunities in detail and was 
presented at the Earth Resources Session of the XXIst Congress of the 
International As tronautical Federation, Constance, West Germany. The 
concept of a broadly based resource information system with updating 

r; ■ ' 
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provided by satellite monitoring in conjunction with sampling data from 
aerial photographs and ground measurements was also outlined. 

Simultaneously with the work on the development of applications of 
mul ti stage sampl ing theory to resource surveys using satellite data, an 
investigation was being conducted on earth resources information systems 
by Dr. Jan van Roessel, co-investigator in the present investigation. 

The investigation, titled, "The Automatic Mapping of Forest Resources 
Using Digitized Stereopairs of Aerial Photographs," developed many of 
the basic algorithms and computer programs in analytic photogrammetry , 
coordinate transformations, and image transformations needed to complete 
the present work. 

The multistage sampling method using space and aircraft imagery, 
evaluated in this investigation, was an extension of a timber inventory 
project previously completed by Earth Satellite Corporation for the 
Southern Pacific Land Company. The multistage inventory method employed 
that used variable probabilities of sample selection was itself an 
outgrowth of previous survey techniques that employed aerial photography 
in concert wi th field sampling. Therefore, before proceeding with the 
results of this investigation, we will first review some of the previous 


3 



methods most often used, the Apollo 9 survey, and the Southern Pacific 
Inventory design that was used as a base for the present SKY.LAB/EREP 
investigation. The multistage variable probability theory, the derivation 
of the optimal allocation formulas, and the derivation of the variances 
are given in Appendices A and B. 

1.2 INVENTORY METHODS EMPLOYING AERIAL PHOTOGRAPHY 

There have been many techniques proposed for estimating timber 

volumes by means of aerial photograph . Most of the methods, employing 

medi urn- to small -scale aerial photography, have been well described and 

tested (Aldred, 1971). Double sampling is often employed in conjunction 

wi tin point sampling where many photo points are marked on aerial photo- 
■ ■ i . . 

graphs. These points, constituting the primary sample, are examined 

by photo interpreters and predictions relating to the volume of timber 

per unit; of land area are made. Then a subsample is drawn according 

to one of several rules and these points are visited on the ground where 

actual volumetric measurements (or estimates) are obtained. This method 

of sampling is relatively efficient for surveying very large areas in a 

general way. The California Forest Survey, conducted by the Forest 

Service, U.S. Department of Agriculture, has used double sampling fpr 

stratification for the past several years. Other proposals for the 

use of double sampling include double systematic sampling for regression 

with multiple random starts (Shiue and John, 1962), double sampling 

with regression for estimating Douglas- fir mortality (Wear, 1964), and 

double sampling with pps (probability proportional to size) selection 

in the second phase (Schreuder, 1966). ' 

While double sampling employing photo points on medium- to small- 
scale photography appears to be relatively efficient for general 
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surveys over large areas, it is usually not suitable for more intensive 
surveys obtained for management planning on specific tracts of land. 
Here, the forest manager desires not only information about the forest 
as a whole, he also desires information about the location of his 
resources parcel by parcel (or even stand by stand). One of the most 
comnonly used methods for obtaining information to this degree of 
Intensity is to partition the forest into stand categories. This is 
accomplished by delineating, on. the imagery , forest areas that are 
relatively homogeneous in their makeup of forest trees. Treating the 
stand delineatioas as sampling units within strata, a sample is drawn 
from each stratum according to one of a number of sampling rules. 
Examples of these sampling rules are simple random, probability propor- 
tional to size (as measured by area or predicted timber volume), or 
simple random with optimum allocation among strata. 

After a sample of timber stands is drawn, a subsample of trees 
or plots is usually located and the volume of timber is measured at 
each subunit. The measurements are converted to an average volume per 
unit of area which, when multiplied by the area of the stand, yields 
an estimate of the total timber voTume in the stand. 


The stratified stand method of sampling for [estimated] timber 
volume has generally been considered to be relatively efficient 
compared to sampling solely on the ground, at least in virgin forests. 
This is because the cost of stand delineation on aerial photos and the 
transfer of these del ineations to maps of a known scale is relatively 
low (about $0.20 to 0.30 per acre) compared to the cost of measuring 
a plot on the ground ($40.00 or more per plot) • Actual comparative data 
to substantiate this claim are hard to come by, however. 
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As logging has progressed across the countryside, the virgin forests 
have been cut over and many have been replaced by s;econd growth forests. 
Residual and second growth forests are often more heterogeneous than 
virgin forests, however, consisting of clumps of trees here and there 
with a few old residual trees scattered about. In this situation, the 
possibility of reducing the variation of volume measurements by means 
of stand stratification on aerial photographs is greatly diminished. 

The authors have experienced situations in which no gain in sampling 
efficiency was achieved by stratifying cut-over stands by means of 
delineation on aerial photographs. This is an important consequence when 
considering whether or not to include space-acquired data in forest 
inventories. 


In spite of the difficulties of partitioning forest areas into 
homogeneous stands in heterogeneous stand situations, the forest stand 
map is still a highly valued tool of the forest manager. If these 
maps, or their pictorial counterparts, known as orthophotos, are to 
continue to be used as bases for stratification, other procedures might 
be considered to counteract the high wi thin-stratum variation often 


encountered. A second possible alternative to timber stand maps as 
bases for stratification is EREP S-190B space photography. While 


individual trees are not discernible, the boundaries of important 


vegetation categories, such as conifer and hardwood, can be accurately 


identi fled. 


As mentioned earlier, medium-scale (1/15,000 to 1/20,000) aerial 
photographs have been used in timber inventories for many years as an 
aid to management planning. However, these medium-scale photographs 
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leave much to be desired in the overall efficiency of a resource survey. 

In many situations, such as the mixed conifer stands on the west slope 
of the Sierra Nevada mountains in California, the scale is too small for 
determining species or for pinpointing sample locations on the ground 
except in a broad way. At the same time, the scale of these photos is 
too large for survey efficiency because of the relatively small area 
covered per photo. For example, a 1/15,000 scale photo covers only 
1/16 the ground area covered by a 1/60,000 scale photograph, and only 
1/256 the ground area covered by a 1/120,000 scale photo such as obtained 
from the U-2 and RB-57 aircraft. Consequently, a large number of photos 
is required at 1/15,000 scale to cover an area of even moderate size, 
thus making handling and mapping costs unnecessarily high. Furthermore, 
the total information imaged on medium-scale photographs is seldom used 
except for forest stand mapping--a task which can now be performed 
using small-scale, high-altitude photographs. 

The development of high -altitude aircraft and cameras with finer 

i , ■ ■ 

resolution of detail have made it possible to incorporate photographs 
of from 1/40,000 to 1/80,000 or even 1/120,000 scale in a forest 
inventory design. The detail in these photographs, if flown to proper 
camera-film-filter specifications, appears to be adequate for stand 
mapping and preliminary volume prediction. 

Saiilple strips of large-scale (1/2,000 to 1/6,000) 70mm color photography 
drawni with probability proportional to predictions obtained from the 
high-altitude photographs, can then be reflown using large-scale photo- 
graphy. The large-scale 70mm photographs are easily regTstered with 
the high-altitude photos. Furthermore, interpretattons can be made 
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as to tree size and species. From this^ information, ground samples 
can be drawn and precisely located on the ground. 

1.2.1 Inventory Designs Using Highland Low-Altitude Photography 

Aldred (1971) made very rigorous analyses of the suitability of 
various stratified double sampling estimators. In his work, large- 
scale sample photographs were used to define a large primary sample Of 
photoplots with ground measurement on a relatively small subsample of 
these plots. Small-scale photos (up to 1/40,000) were used to define 
the strata. According to Aldred, both the small-and large-scale photos 
provided gains in sampling efficiency within the range of prescribed 
conditions he studied. Langley (1961) recognized the potential of high- 
quality small-scale aerial photos in extensive forest surveys and later 
developed and applied multistage variable probability sampling theory 
in a pilot forest survey which utilized Apollo 9 space photography, small- 
scale aerial imagery, and large-scale 70mm aerial imagery (Langley, 
et al . , 1969). These experiments were also successful, in that a 
reduction in the estimated sampling error was attained at the high- and 
low- altitude levels by taking advantage of the additional information 
provided by the i ncreasingly 1 arger scales of photography. In both 
Aldred's and Langley's approaches, the high-altitude imagery was used 
to first parti ti on the target population into strata corresponding to 
logical categories of distinctive features. Aldred then applied double 
sampling with large-scale photoplots, of such dimensions that a subsample 
of these could be measured in their entirety on the ground. This was 
not feasible in the Apollo 9 experiment because the photography obtained 
from orbital altitudes was of insufficient resolution to allow direct 
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keying to large-scale aerial imagery. At least one intermediate scale 
had to be provided and the multistage sampling framework was a convenient 
method for doing so. 

1.2.2 Some Desi gn Consi derati ons 

In forest inventories using small-scale aerial photographs to make 

stand measurements and large-scale photographs for more detailed tree 

measurements, there remain many unanswered questions about the optimal 

design configuration. For example, the double sampling approach would 

assume a random drawing of the primary sample within strata^ However, 

for operational efficiency, an aircraft, once on target,can traverse 

/ 

a long strip of a mile or more covering a cluster of ground plots for 
practically the same cost as covering only 1 plot. Photo interpreters 
can assess tree characteristics in all the plots in the cluster and 
select relatively few plots for ground measurement. When this procedure 
is followed, a gain in efficiency can be realized by selecting individual 
strips at random with probability proportional to some measure of size 
obtained from the high-altitude small-scale photography. Hence, one 
or more sample strips extending clear across the sample units can be drawn 
with varying probability and large-scale photographs can be obtained 
of these. After interpreting the large-scale photographs, a small 
subsample of field plots can be drawn within strips, again with a 
conditional probability proportional to a measure of size such as 
predicted volume of timber. Using this general procedure, the 
operational advantages of clustering are enjoyed at every step. 
Furthermore, easily obtained information is fully utilized from all 
levels of photography or other type of imagery. 




1.3 THE APOLLO 9 FOREST SURVEY EXPERIMENT 

The flight of Apollo 9 in March of 1969 included photographic 
experiments (NASA designated Scientific Operation 65) which provided 
70mm multi spectral photographs of limited areas in the southern 
United States. These photographs were obtained at a scale of appro- 
ximately 1/2,225,000 by means of Hasselblad cameras and provided 
synoptic coverage of approximately five million acres per frame. Four 
of these frames obtained from Apollo 9, two in the Mississippi Valley 
area and two in the Atlanta, Georgia area were taken at a time when the 
weather was clear. Consequently, they provided an outstanding oppor- 
tunity to apply multistage variable probability (pps) sampling to a 
forest survey using space and aircraft photography. 

The complete description of this trial has been described else- 
where (Langley, et al ., 1969, Langley. 1971). The initial trial 
included the selection of five primary sample units in the Mississippi 
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Valley area and five in the Georgia area. Each primary unit contained 
a square plot of land approximately foiir miles 'on a side and containing 
10,240 acres. Within each primary unit, a subsample of two strips was 
drawn. Each strip was approximately a third of a mile wide and traversed 
the primary unit. 

The third sample stage consisted of triplets of 1/2,000 scale 
70mm color photographs within which a sample of one ground plot per 
strip comprised the fourth stage. Sample trees within the ground 
plots comprised the fifth stage. Figures 1 through 5 illustrate the 
nesting of successively larger photo scales that was used in the 
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reproducibilot 

A T PAftR IS POOiv 



Courtesy of NASA and 
U.S. Forest Service 


Figure 1. A reproduction made from Apollo 9 infrared color frame 
3740 of the Mississippi River Valley. Forest areas 
consisting of upland pine and hardwood are discernible 
on the left while lowland hardwood areas are discernible 
on the right intermingled with agricultural lands. The 
grid of 400 four- by- four- mi le squares was used to predict 
timber volumes for the first level of inventory information. 
A grid cell was defined as a primary sample unit. 
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Courtesy of U.S. Forest Service 


Figure 2. Polaroid photography (1/60,000) was taken over each primary 
sampling unit selected for the first stage in the multistage 
sample. The mosaic shown is for the four-by- four-mil e square 
indicated by the arrow in the upper center of the space 
photo in Figure 1. 
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Figure 3. 





— 12 000 strip photo 

— l‘2 000 frame 
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1.60 000 photo — — » 


Courtesy of U.S. Forest Service 


'he scaled diagram shows how the two 1/12,000 scale 70mm 
ihoto sample strips and 1/2,000 scale 70mm color samples 
ire related to each other and to the 1/60,000 scale 
)olaroid photography. 
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Courtesy of U.S. Fo^'est Service 


Figure 4. This 1/12,000 scale photography, enlarged approximately 
two times, was taken from one of the sample strips flown 
over the area shown in Figure 2, according to the confi- 
guration indicated in Figure 3. The area covered is 
indicated by the square in the lower right-hand portion of 
the photo in Figure 2. 
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Figure 5. 



Courtesy of U.S. Forest Service 


A 1/2,000 scale photograph showing a portion of the area 
covered by Figure 4. The grid divides the photograph into 
four sample plots approximately .63 acres in size which is 
convenient for location on the ground and subsequent timber 
volume measurements. 
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Apollo 9 inventory trial. The dendrometer in Figure 6 was used to 
obtain tree measurements on the ground plots. 

The Apollo 9 forest survey experiment demonstrated that it was 
possible to undertake broad resource surveys beginning with information 
derived from imagery obtained at orbital altitudes. It also demon- 
strated that it was feasible to identify from an aircraft and rephoto- 
graph large sample areas by means of visual reference to the space 
imagery. 

The information derived from the space photography provided an 
estimated reduction of 58 percent in precision in the Mississippi 
Valley area while no gain was achieved in the Georgia area. In evaluating 
these results, it is important to keep in mind that we had no previous 
experience in differentia.tlng between forest trees and other vegetation 
from space imagery. Since that time, new equipments have been developed 
that allow the simultaneous analysis of several spectral bands. 

Experiments have been under way by several NASA sponsored investigators 
to develop new image interpretation techniques that should yield better 
predictions of forest variables by species groups from orbital altitudes. 
As these techniques are developed, sample surveys which utilize this 
information wil 1 be improved at the first stage. 

1.4 THE SOUTHERN PACIFIC TIMBER INVENTORY 

Forest managers more concerned with short- and long-term management 
planning on specific tracts of land must have detailed resource infor- 
mation pertaining to those tracts. Volume, growth, and number of trees 
by species and tree size are examples of the kinds of information 
required. 

. 




Courtesy of U.S. Forest Service 


Figure 6. An optical dendrometer was used to make bole measurements 
on four to six trees on each ground plot. 
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In 1970, the authors, under the auspices of the Earth Satellite 
Corporation, undertook a timber inventory of a portion of the forest 
holdings of the Southern Pacific (SP) Land Company. The half-million 
acre "alternate section" layout of the SP lands, scattered over nearly 
two million acres in several northern California counties, provided 
an almost ideal opportunity to implement multistage variable proba- 
bility sampling using high- and low-altitude aerial photography. In 
addition to the extensive area covered by the survey, the properties 
were situated in some of the most rugged terrain in northern California 
including portions of Trinity County. An example of the "checkerboard" 
layout of the SP land is shown in Figure 7. 

1.4.1 The Sampling Design 

The SP Land Company possessed records showing the estimated total 
volume of timber on each parcel of land within the area of interest. 
However, since much of the information had come from different sources, 
there was little indication of the reliability of the figures from 
parcel to parcel. So while individual parcels, i many of which were 640 
acres in size, might have made satisfactory primary sample units 
(psu's), it was decided to cluster them together in the hope of reducing 

i 

the variation among psu's. The specification for the scale of the 
high-altitude aerial photography was set at 1/40,000 for several reasons 
(1) the effective area of each stereo model would cover- approximately 
8 square miles of land area, a reasonable amount for timber mapping 
purposes, (2) individual: trees would be discernible for effective timber 
stand delineation by tree size classes, and (3) when employing the 
desired focal length of 8-1/4" the required flying height of 27,500 feet 
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RKPRODUCIBILITY OF THE 
rkmr.TMAT, PAGE IS POOR 



Courtesy of Earth Satellite Corp. 
and Southern Pacific Land Company 


Figure 7. An example of the "checkerboard" layout of SP lands. The 
photo positions of the section corners, with the effects 
of tilt and topographic displacement added in, are deter- 
mined from map data and plotted on the photography by 
computer methods. The group of four sections showing on 
this photo makes up one primary sample unit. 
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above terrain plus an average ground elevation of 5,000 feet put the 
propeller-driven aircraft at its operational ceiling of 32,500 feet. 

This latter was an important consideration because switching to a jet 
aircraft would have entailed a substantially higher cost. At the 
1/40,000 photo scale, up to four or five 640-acre sections of SP land 
would be imaged on the effective area of each photo forming a cluster. 
Since individual photos were convenient to handle, it was decided to 
designate the SP lands falling within the effective area of 1/40,000 
scale photographs as the psu's. 

The company also wished to have the new inventory estimates 
reported by survey units as well as for the total area. To accomplish 
this, the survey units were treated as strata and variable probability 
sampling was undertaken independently within each stratum. The sampling 
rule was to select psu's with probability proportional to the timber 
volume shown on the company records. The primary uhits were drawn 
with replacement. One hundred ninety two primary units were drawn 
out of a finite population of nearly 400. As it was to be expected, 
several psu's were drawn twice. On these occasions, independent sub- 
sampling was undertaken for each primary draw. 

Hew 1/40,000 scale aerial photography was obtained and the company 
land was delineated on these photographs (Figure 7). The timber volumes 
of record at that time were cumulated by photograph and by management 
unit; then they were transformed to sampling probabilities. 

The overall design goal was to arrive at an estimated sampling 
error (standard error) of approximately 10 percent of total net volume 
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per management unit when evaluated at the 67 percent confidence level. 

However, to minimize overall variance, the primary samples were allocated 
to survey units in proportion to stratum size as measured in units of 
predicted volume. In this allocation, the error terms and the cost 
factors were considered to be constant and only the stratum sizes were 
allowed to vary. All primary sample units were allocated to survey units 
in this way. The primary sample v/as then drawn at random (with replacement) 
with probability proportional to the predicted timber volumes of record 
at the beginning of the survey. 

1.4.2 The Second Stage 

The second stage of the sampling plan consisted of strips of 70nn 
color photography traversing across sample parcels. Because of the 
alternate section arrangement, each strip was one mile or less in length. 

To obtain the selection probabilities, each ownership parcel delineated 
on the high-altitude photographs included in the primary sample was 
partitioned into strips oriented in a North-South direction. The width 
of the strips was approximately eight chains so as to include 10 per mile. 
Figure 8 illustrates the layout of the primary, secondary, and tertiary 
sampling units. 

By means of photo interpretation, a timber volume prediction was 
obtained for all strips delineated on a sample photograph. These predictions 
were translated into conditional probabilities and a subsample was drawn 
in each psu. The number of strips in each draw was approximately proportional 
to the square root of the size of the psu as measured in terms of predicted 
timber volume. This was the optimal allocation under the assumption that 
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Approx. 1 mile 



Figure 8, Three-stage sampling layout of the SP timber inventory. 

The aggregate of sections, such as in the above example, 
constitutes one primary sampling unit. The aggregate 
of 70min photo strips constitutes a second-stage unit 
and the third-stage unit is the 1-chain ground strip. 
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the error terms and the cost of measurement were constant over all 
primary units,. An average of two sample strips per primary unit were 
drawn and allocated as described. A completely optimal allocation was 
not possible because no estimates were available as to the magnitude 
of the first and second stage errors. The average number of two was 
decided on from intuition based on previous experience of Langley. 

After having selected approximately 400 sample strips, they were 
rephotographed in color at a scale of approximately 1/4,000 by means 
of a Hasselblad 70mm camera. After obtaining the low-altitude photo- 
graphy, the boundaries of the 8 chain sample strips were delineated 
on them by visual comparison with the 1/40,000 scale photos when viewed 
through a magnifying stereoscope. 

1.4.3 The Third Sample Stage 

According to theory, the 70mm sample photography would have been 
partitioned into plots or strips of such a size that a single plot 
or strip would be measurable on the ground. Then, by means of photo 
interpretation, sampling probabilities would be formulated so that 
selections would be made at random with probability proportional to 
timber volume as in the first and second stages. These selected plots 
or strips would then be located in the field and the trees would be 
measured for timber volumes. 

Because of compelling limitations in available funds and a 
tight time schedule, i t was not possible to conduct the sampling 
in the third stage strictly according to pps theory. However, the 
procedure followed closely approximated the theoretical plan and 
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at the same time resulted in some substantial savings in the form of 
increased field efficiency. 

Because of cost constraints attributable in part to the rough 
terrain and inaccessabil ity of the land, we were able to visit on the 
ground only one second stage unit (8 chain strip) per primary unit. 

In addition, we were not able to interpret the 70mm photography before 
the time of the field work because of a tight schedule. This situation 
arose from a heavy snowpack which lasted late into the spring that 
year which prevented taking the low-altitude photos in time to interpret 
them prior to the field season. 

Because of these problems, the procedure followed was to locate 
a ground strip 1 chain wide more or less dov^n the center of one of 
the second stage eight chain strips in each primary unit (Figure 8). 

This configuration promised to traverse most of the varying conditions 
present in a strip and yield the largest number of trees measured in 
the least time. As for the problem of which second stage strip to 
choose for ground sampling, it was decided that one 8 chain strip 
would be drawn at random with probability proportional to the second 
stage volume prediction obtained from the high-altitude photography. 

Then after completing the interpretation of the large-scale TOnm 
photography, expansion factors would be derived that were identical to 
what the third stage probabilities based on volume would have been 
had the interpretation of the 70mm photos been done beforehand. 

The weight (or expansion factor) that was applied to each one 
chain ground strip was the inverse of the ratio of the predicted 
timber volume (derived from photo interpretation) of the trees falling 
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within the one chain strip divided by the predicted volume of the 
trees falling in all the eight chain strips included in the secondary 
draw . 

The interesting results of the survey from the sampling standpoint 
are the sample estimates by survey unit, the coefficients of variation 
observed at the first stage, the sampling errors, and the sample sizes. 
These figures are given in Table 1. Also of interest are the first, 
second, and third stage error terms and the optimum allocation of 
survey funds. All of these intermediate results are not available 
because the calculations were performed in total by means of an elec- 
tronic computer. However, intermediate results were obtained for one 
survey unit for the purpose of checking out the program. From these 
intermediate results, the optimum allocation data shown in Table 2 were 
obtained. 

In addition to having volume estimates summarized by survey unit, 
the company also wished to have the estimates reported for each parcel 
of land included in the sampling frame. The final estimates obtained 
for these parcels, which were mostly one square mile in size, were 
used in the present EREP investigation to evaluate the comparative 
gain in the precision (variance) of the same inventory had the S-190 
imagery been used at the section level. 

Obtaining the section estimates was accomplished by first parti- 
tioning each parcel into elevation-aspect compartments on the 1/40,000 
scale photographs followed by further partitioning into major stand 
size-density groups which we labeled forest systems. As the field 
work progressed, the data were kept separately by forest system. 
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Selected Statistics From The Southern Pacific Timber Inventory 


Survey 
Uni t 

No. of 
psu's 

Estimated 
Timber 
Vol ume 
(MM FBM) 

1 

66 

944.0 

2 

11 

250.3 

3 

27 

563.6 

4 

12 

223.5 

5 

5 

94.0 

6 

14 

204.4 

7 

25 

373.6 

8 

28 

372.6 

9 

5 

57.1 

Total 

193 

3,083.1 


First Stage 
Coefficient of 
Variation 

Sampling 
Error (%} 

.45 

5.57 

.31 

9.45 

.35 

6.73 

.36 

10.25 

.10 

4.64 

,48 

12.82 

.62 

12.37 

.57 

10.69 

.16 

6.98 


3.29,^ 


Table 2 


Optiniuin Allocation Data For One Survey Unit 
(Southern Pacific Timber Inventory) 

Total Cost 
Allocated to Unit 


D* = $58,800 


=> 53 


73.22 73 


73.56 74 

Hence, the optimum allocations to sampling stages compared to the 
actual are: 

Optimum 

m° = 53(0^) = $11,546 

n° = 73 ( 02 ) = 19,509 

74 ( 83 ) - 27,750 

Total Cost $58,805 


Actual 

111 = 66(D^) = $14,378 
n^ = 68 ( 02 ) = 

tj =70(03) = 26,250 
$58,800 


Sampling 

Stage 

1 

2 

3 


Error 

Terms 

E, = 3.0854 X 10 


10 


Cost Per 
Sample Unit 

0^ = $217.85 


10 


E. = 7.1763 X 10'^ Do = $267.24 


Eo = 1.0207 x 10 


11 


D 3 = $375.00 


ni = 


/EiD^+/4D2f/E^ 


= 53.06 


n-r = 


00 0 

m ri = m < 




E,D 2 


= 53.06(1.38) = 


rv ^ ^ FoDo 

.0 0 OtO _ 0 2 1 32 

^T ■ t 7” ^e^D2 


= m 


Eo 0 , 

0 3 1 


'^1 ^3 


= 53.06(1.39) = 


27 


Table 2 (cont.) 


Using equation (29) to approximate the relative gain in 
precision from optimum allocation, we have 


[Var(v)]°P‘- 

[Var(v)r'=‘- 


-J. + 1 = 2 9445 X 10^ 

53 73 74 


E, Ep n 

7§= 2.9810X10^ 


Hence the relative gain under optimum allocation would have been 


[Var(v)]^^t:-[Var(v)]°P t-^,OP = ,.2« 
[Var(vl]°Pt- 



During the computation phase, the volume per acre by tree size group 
and species was determined for each forest system encountered in the 
sampling process. 

The size-density coding system used to describe the forest system 
was a numer^ical one from which we developed regression coefficients to 
translate the size-density codes assigned to each forest system into 
volume predictions by three tree diameter groups (1) 12" to 19", 

(2) 20" to 35", and (3) 36" +. The equations were applied to the 
assigned codes for all forest systems in all land parcels. The species 
breakdown was derived by means of a nearest neighbor rule based on survey 
unit, elevation, aspect, and slope. The estimated volumes for each 
forest system were summed by parcel within each survey unit and adjusted 
to the totals obtained from the multistage sampling plan. Finally, 
the estimates for each forest system were written into a computer file 
making them available for more detailed analyses later on. Further- 
more, the data are available for deriving new sampling probabilities 
for the purpose of conducting special purpose surveys to obtain improved 
estimates of selected components of the timber resource base. Therefore, 
the SP Land Company achieved the major first step in establishing a 
timber resource information system which can be used in day-to-day 
decision-making and at the same time provide data for improving the 
efficiency of updating the file through new sample surveys. 



2.0 OBJECTIVES OF THIS SKYLAB/EREP INVESTIGATION 

The events of the recent past, outlined in Section 1 above, point 
to a logical sequence in the development of forest sampling techniques 
that are applicable to projects covering large areas. Early techniques 
employed aerial photography of medium-scale, usually 1/20,000, then 
later to 1/15,840 on National Forests, and finally to 1/12,000 scale on 
some private lands. As aerospdce technology progressed, there developed 
an increased interest in utilizing high resolution photography obtained 
from high-flying aircraft. Photography obtained from Apollo 9 gave 
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the first indication that space-acquired data, even though of relatively 
coarse resolution, might be employed advantageously in forest surveys 
covering large areas. The main advantages of imagery acquired from 
space seemed to be that (1) entire forest populations could be covered 
nearly instantaneously, (2) mul tispectral data with good registration 
among spectral bands could be obtained, and (3) a standard of uniformity 
could be established in the data base that would allow meaningful 
comparisons over space and time. 

It was apparent from the Apollo 9 experiment that further work 
was needed in at least three areas before space-acquired data could be 
effectively employed in operational forest sampling applications. 
Firstly, the sampling procedure needed to be examined in more detail 
from several standpoints: (1) to obtain an impression of its 

generality, i t had to be applied to a different area, (2) it needed 
to be determined if primary units smaller than four miles square, as 
used in the Apollo 9 experiment, could be effectively employed at the 
space level, and (3) a method for optimally allocating survey funds to 
various stages in the sample design needed to be developed. 
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Secondly, since estimates of forest parameters are usually reported 
by ownership class, and often by political subdivisions, a capability 
was required to identify the space-acquired data by these categories 
for survey purposes. Therefore, a system was required to locate boundary 
points as accurately as economically feasible on the imagery and in the 
digital data base. The algorithm to be used had to account for image 
displacements caused by topography, earth curvature, spacecraft attitude, 
and the imaging systems. 

Thirdly, an investigation of image interpretation and analysis 
techniques for forest survey purposes was required (1) to determine 
the feasibility of incorporating the mul tispectral and temporal 
characteristics of the data into the design and (2) to determine the 
extent to which these data could be employed to improve the precision 
of forest inventory estimates. 

To satisfy the above requirements, the objectives of this EREP 
investigation were to (1) develop transforms for the location and 
transfer of boundaries, areas, and points of interest from topographic 
maps or coordinate descriptions to high-flight aerial photographs and 
satellite imagery, (2) further develop multistage sampling techniques 
to inventory forest resources, and (3} if possible, identify interpre- 
tation techniques which take advantage of the mul tispectral and temporal 
characteristics of the data. 

After obtaining the techniques, they would be applied to a test 
area for which timber volume estimates are available to develop new 
timber volume estimates which incorporate the satellite data, and 
determine the gain in efficiency attributable to the S-190A, S-190B, 
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and.S-192 systems. The test area for this investigation is defined as 
the collection of land parcels, one square mile in size, owned and 
operated by the Southern Pacific Land Company (SP) in the portion of 
Trinity County, California, covered by the EREP imagery. 



3.0 COORDINATE TRANSFORMATIONS 


3.1 INTRODUCTION 

The development of a system to locate sample unit boundaries in 
the various kinds of aerial and space imagery was an integral part 
of our investigation to finalize a multistage forest inventory technique 
The necessity for Such a subsystem was demonstrated by the Apollo 9 
multistage inventory trial (Langley et al . , 1969). At that time the 
sample units were simply defined by superimposing a uniform grid on 
the satellite image (Figure 1). Thus, to obtain greater spatial 
correspondence between sample units in the various stages and to insure 
that the image interpretive data would in fact relate to the alternate 
section layout of the Southern Pacific lands, we developed a computa- 
tional method for transferring the corner coordinates of sample units 
from topographic-ownership maps to the RB-57 and EREP data. 

3.2 APPROACH 

The common approach taken for all three kinds of imagery, aerial, 
EREP S-190, and S-192, was to perform an analytical resection for each 
individual image or subimage. The input to such a resection consists 
of the image coordinates and ground coordinates of a set of commonly 
identifiable points. The spatial location of the image center and the 
orientation parameters of the image are the outputs of the procedure. 

To transfer points from topographic maps to space platform imagery, 
we developed a generalized resection program in which any resection 
parameter can be enforced in the solution to any desired extent. This 
allows for the use of orbital parameters in the resection solution. 
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In addition to the resection program, we developed a technique 
applicable to space platform photography in which elevations can be 
assigned to digitized map points through the use of digital terrain 
models. Using this technique tedious manual elevation assignment for 
thousands of digitized points can be avoided. 

With estimates of these parameters in hand, other points such 
as property corner points can be projected onto the images using 
analytical methods. 

3.3 IMAGE ANNOTATION OF 1/40,000 SCALE AERIAL PHOTOGRAPHS 

In the SP forest inventory we obtained good results with individual 
spatial resections of 1/40,000 scale aerial photographs. We developed 
a production program with a semiautomatic qual ity control that could 
process a large number of; aerial photographs. The inputs to this 
program were the coordinates of a set of photo points which had been 
identified on USGS topographic maps and put into digital form with a 
map digitizer, as well as the set of digitized points defining the 
property boundaries. 

The computed res ul ts were plotted with a Hewlett-Packard 9100A 
plotter on transparent stable material. The resultant overl ay was 
then composited with the aerial negative and a combined print made. 

An example of such an annotated print is shown in Figure 7. 

3.4 IMAGE ANNOTATION OF RB-57 HIGH-FLIGHT PHOTOGRAPHY 

To annotate the U-2 RC-10 1/126,000 scale hi gh-fli ght photographs 
in our ERTS investigation, we had the option to either find a set of 
individual ground control points for each photograph or to use ground 

. 34.' . 



control for only a few points and then to extend this control to each 
photo by means of photo- triangul ation and block adjustment. The 
latter approach was taken for the following two reasons. First, a 
block adjustment would yield greater precision than using control 
points from USGS planimetric maps, and second we could obtain precise 
coordinates for points which could both be defined on the high-flight 
aerial photographs and the satellite images. 

Since it is somewhat difficult to identify map features on space 
imagery in rugged, mountainous terrain, we set out to identify a set 
of natural Tandmarks on the hi gh-flight aerial photographs that could 
also be readily identified on the space images. We then used the aerial 
photographs to determine the ground coordinates of these points by 
executing a block adjustment. This block adjustment also provided the 
control for resectioning the aerial photographs. This approach was thought 
more desirable for the aerial stage because of its inherent greater 
accuracy due to simultaneous adjustments and the use of a precision 
MANN TAl/P monocomparator for coordinate measurement. 

To minimize the programming effort, ground coordinates determined 
with the block adjustment, together with their measured plate coor- 
dinates, were fed into the existing resection program to perform the 
final transformation of the digitized SP section corners for the 
aerial stage. 

3. 4. 1 Block Adjustment 

The block adjustment wa.s performed after the two strips of high- 
flight aerial photographs with ten photos each had been triangulated. 

Schuts triangulation and block adjustment programs were used for this 
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purpose. Coordinates were expressed in a secant plane system, with 
its origin in the test areas, to remove the influence of earth curvature 
The standard errors computed with the control points and tie points 
used in the adjustment proved to be 12.8, 10.3 and 4.4 m for Easting, 
Northing, and height, respectively. The planimetric errors correspond 
to a point identification error of about 0.1 mm on the photographic 
plate. The results can be considered very good in view of the 1/126,000 
scale of the aerial photography. , 

3.4.2 Production of Image Overlays 

The production of image overlays with annotated sample units 
both for the aerial photographs and for the EREP S-190 images took 
place as indicated in the flow diagram of Figure 9. The differences 
between procedures for space and aerial images are indicated by the 
appropriate boxes. 

Three types of points were annotated on the aerial and space images 
These were (1) the primary and secondary sample unit corner points, 

(2) the county line points, and (3) the management unit boundary points. 
The units outlined on the aerial images were GLO land sections and 
fractions thereof. All points were digitized from 10 USGS maps at a 
scale of 1/62,500. Geographic coordinates were assigned to each digi- 
tized point by means of an interpolation method, using a set of map 
control points with knbwn coordinates. The geographic coordinates were 
subsequently converted to secant plane coordinates. Elevations were 
read from the contour maps for all data points and were then added to 
the secant plane coordinates of the digitized points. Separate 
resections for the 20 high-flight aerial photographs were calculated 
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Figure 9. Flow diagram of space and aerial overlay manufacture 
for delineating sample unit boundaries. 
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using the control results from block adjustment. The data point 
coordinates were transformed to image coordinates by means of the 
resection results. The image coordinates were finally plotted on stable 
material templets using a Hewlett-Packard 9100A calculator plotter. 

An example of a high-flight aerial photo overlay is shown in Figure 10. 

As it turned out, there was a high degree of similarity between 
the U-2 images used in the ERTS investigation and the RB-57 images 
taken one year later for this EREP investigation. Both data sets 
were obtained at approximately the same time of year and both were 
available in the 9" x 9" color IR format. Therefore, rather than doing 
a separate resectioning on the RB-57 imagery, the SP corner points were 
visually transferred from the U-2 to the RB-57 imagery (Figure 10). A 
transferscope was used for this process. 

3.5 IMAGE ANNOTATION OF SCANNER IMAGES 

3.5.1 Resection Theory 

In contrast to the resectioning of aerial photographs, EREP S-192 
and Landsat images present two problems: (a) the image is produced 

by a multispectral scanner with a geometry different than that of aerial 
cameras, and (b) the imaging system has a very narrow perspective 
bundle. 

The first problem is in part solved by the NASA data processing 
facilities where the geometry of the MSS images is partially shaped 
to resemble the geometry of an aerial photograph, not considering 
relief displacement, but taking into account earth curvature. The 
second problem causes an instability in the resection. It can be 
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Figure 10. Examples of parcels annotated on a 1/126,000 scale high- 
flight photograph. 
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circumvented by enforcing some of the basic resection parameters, which 
must be determined from accurate outside sources such as the ephermeris 
data in the case of space images. VJe enforce the position of the 
simulated exposure station by assigning it the coordinates of the scanner 
picture center. 

To make the resection program for the annotation of the scanner 
images completely general, so that it could be applied to S-190 images 
as well , we took the approach in which any parameter can be enforced 
at the value of its initial approximation by assigning appropriate 
weights. This approach can be implemented by using the parameter appro- 
ximations in auxiliary equations in the linearized equation system: 

Ap = p°° - p° + e (l ) 

where: 

Ap is the correction to the approximation at the ith iteration 

p°® is the approximation of the parameter value 

p° is the parameter estimate to be enforced in the solution, and 

E is the difference between the least squares adjusted value of 
the parameter and the value to be enforced in the solution. 

For the first iteration, p®° is taken equal to p®. Then, if we place 
a large weight on the auxiliary equation, e will be close to zero, 

Ap will be close to zero, and the initial approximation, which is equal 
to the desired parameter val ue, wil 1 not receive any corrections in the 
iterative process. Thus, the initially assigned parameter value will 
remain unchanged throughout the solution. 
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The normal equations take on the following form for n data points: 
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where a , a^ a , a are the partial derivatives of the collinearity 

^ ^ y y 

equations with respect to the nine resection parameters for a particular 
point. 

The matrix has the following elements (only the first row is 
shown): 
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where e and e are the discrepancies resulting from an evaluation of 

X y 

the collinearity equations for the point under consideration with the 
current set of parameter approximations. 
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The matrix W is a diagonal 11x11 weight matrix with unit weights 
in the first two positions* zero weights for those parameters that need 
to be estimated, and large weights (for instance, 100 unit weights) for 
those parameters that are to remain constant. 

Thus, with the indicated solution we are free to fix or estimate 
parameters as needed. As the orbital path of the satellite is known 
quite accurately, the likely parameters to be enforced in the solution 
are the exposure station coordinates for the center of the image. For 
this purpose, the indicated latitude and longitude of the photocenters 
are taken from the best available source. For our Landsat investigation, 
we used the ERTS catalogue. To obtain the ERTS altitude, we prepared 
a program that computes the exposure station coordinates from orbital 
data furnished by NASA, given the Greenwich Mean Time (GMT) pertaining 
to the image center. However, we anticipated that the latitude and 
longitude indicated in the catalogue would be more accurate than the 
estimates computed by our program, since we only included first-order 
harmonic terms. In the solution, either the artificial focal length 
or the altitude needs to be enforced. We selected the altitude, since 
we assumed that scale change would be introduced in the bulk process 
through the introduction of an artificial focal length. 

3.5.2 Coordinate Systems 

The same secant plane coordinate system used for the block adjustment 
of the high- flight aerial photographs is used for the resection of the 
EREP and other space images. The conversion from the geographic 
coordinate system to the secant plane coordinate system yields coordinates 
of the control points in a cartesian coordinate system. The XY plane 
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of this system slices through the reference ellipsoid, so that the point 
elevations also reflect earth curvature. 

The earth curvature accounts for part of the perspective displace- 
ment encountered in MSS images so that the use of the col linearity 
equation for the along- track direction in conjunction with the secant 
plane elevations is, therefore, partly justified. 

3.5.3 Polynomial Fitting of the Residuals 

To eliminate systematic trends remaining in the point residuals 
after the resection has been performed, we included in the resection 
program a general polynomial fitting routine with which we could fit 
separate trend surfaces through the x and y residuals of the plate 
coordinates. This part of the program can then account for the 
unexplained remaining systematic distortions. 

For the polynomial surface fitting we use hybrid orthogonal poly- 
nomials in the x and y plate coordinates, generated with a recurrence 
relation. The maximum power of the polynomials is automatically deter- 
mined by the program and then discounted to evaluate all possible 
power surface fits. For each power. Root Mean Square Errors (RMSEs) 
can be computed to assess the goodness of fit. 

In the testing phase of the program we discovered that the 
polynomial fitting routine is general enough that, in terms of the 
residuals, almost identical results can be obtained by either performing 
a resection or by keeping all parameters fixed and then making the 
polynomial adjustment. Thus with the present program one can either opt 
for the classical resection technique or obtain the optimum polynomial fit 
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3.5.4 Experimental Results 


To perform the resectioning and produce the sampling unit overlay 
for the first S-190A image, we used the generalized program package 
described above, even though, here, we were not dealing with scanner 
imagery. The 70 mm image was: enlarged to a scale of 1/843,526 and 
transferred to a glass plate on which 34 control points were measured 
with a MANN TAPI comparator. The coordinates of these points, and 
ground control point coordinates obtained from the Coast and Geodetic 
survey were then, used in the spatial resectioning program. This program 
is capable of doing a classical resection as well as an empirical poly- 
nomial adjustment. The overall RHSE of the resectioning proved to be 
106 m on the ground. With an additional 2*^*^ power polynomial adjustment 
R^^5Es of 100 and 90.2 m were obtained in the x and y directions, respec- 
tively. These results are better by a magnitude of 2 than those obtained 
from ERTS, due to a better defined geometry, the use of SKYBET parameters, 
and a better ground resolution for control point identification. Examples 
of the annotation of the sample unit corners (GLO section and interior 
corners) can be seen in Figures 11 through 16. 

A similar procedure was used on all of the,S-190A and S-190B images. 
Unfortunately, because of delays in receiving the final S-192 data, 
combined with problems at our local computer center and cost limitations, 
we were unable to produce overlays for the S-192 images and data base. 
However, the process was successfully applied to Landsat imagery in our 
test area, and the qualities of this resectioning are given in 
Table 3. 
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Figure 11. S-190A color, June, 1973. 



Figure 12. S-190A color IR, June, 1973. 
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Figure 13. S-190A color, September, 1973 


Figure 14. S-190A color IR, Septeriter, 1973 
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Figure 15. S-190A color IR, June/September, 1973 composite. 



Figure 16. S-190B color, September, 1973. 
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Table 3 


RMSEs for Resection of Landsat 1 MSS Images 103 and 104 

(millimeters) 

(x 1000: meters on the ground) 




After polynomial adjustment 

Image 

Resection 
Resul t 

1 

iing^iii 

3 






RMSE for X 

0.142 

0.129 

0.123 

— 

RMSE for y 

0.146 

0.138 

0.116 


Resul tant 

0.144 

0.133 

0.119 

-- 

Image 104 
(30 points) 





RMSE for X 

0.233 

0.225 

0.217 

0.187 

RMSE for y 

0.196 . 

0.181 

0.176 

0.146 

Resultant 

0.215 

- - ■ . _ 

0.204 

0.198 

0,167 


We can see in Table 3 that the RMSEs vary from 0.119 to 0.233 mm. 

This variation ranges from one to two times the identification accuracy 
of 'he U-2 RC-10 photography (0.1 mm), which is reasonable considering 
tha’ resolution of the Landsat MSS images is considerably less than 
that of the RC-10 photographs. Thus we can conclude that the point 
identification accuracy is the limiting factor with respect to the resection 
quality. Extending this argument to the S-192 data, we can hypothesize that 
the RMSEs would fall between those for the high-flight aerial photography 
and the Landsat MSS images. On the other hand, the circular scanning 
mode of the S-192 instrument, with the need to convert to a rectangular 
coordinate system for resectioning purposes may introduce additional 







errors into the process, thus degrading the point identification accuracy 
to near the level of Landsat MSS imagery. Because of the problems stated 
above, we were not able to establish the exact accuracy attainable with 
the S-192 scanner, unfortunately. 

3.5.5 The Production of Image Overlays 

After the resectioning of the S-190 images was completed, the 
results were stored for subsequent use in the production of the image 
overlays. In producing the overlays, we used a second-degree residual 
estimation. Even though the benefits of this estimation were not clear- 
cut, we were convinced that the accuracy would not be degraded because 
of it. 

At the beginning of our investigation we decided to take 4x4 
blocks of one-square-mi le GLO land sections as the primary sampling 
units. Thus, one primary unit would cover approximately 16 square miles. 
This was consistent with the size of the psu's used in the earlier 
Apollo 9 experiment. 

Later on in our investigation, after having performed several 
resections Of space images, we concluded that there was sufficient 
geometric accuracy for the primary sampling unit to be as small as a 
one-square-mile GLO land section. Consequently, we used the poi nt data 
base developed for the annotation of the high- flight aerial photographs 
and projected these section corner points onto the S-190 images. All 
points were digitized from three maps at a scale of T/2-inch to the 
mile. Geographic coordinates were then assigned to each point by means 
of an interpolation method using a set of map control points with known 
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coordinates. The geographic coordinates were subsequently converted to 
secant plane coordinates. At this time no elevations had yet been 
assigned to the digitized map points. 

To produce elevations for the digitized points, we developed a 
digital terrain model of which a hypsocline chart is shown in Figure 17. 
This model covers an area of 125x125 km of our test area and includes 
part of the Sacramento Valley, the Trinity Alps and the Mount Shasta 
regions. In this region, the maximum terrain variation is 14,000 ft. 
because of the presence of Mt. Shasta. Elevations for the model were 
obtained by sampling an aeronautical chart with an 18x18 grid. 

A test of the model showed that the RMSE of the actual terrain 
around the model surface amounted to 330 m. Under the assumption that 
the space image has a perspective geometry (including relief displacement), 
it can be shown that elevation errors in the order of 330 m would induce 
plate position errors of 37 micrometers on Landsat imagery, whereas, 
errors of 111 micrometers would be incurred by assuming a mean terrain 
elevation. 

After the elevations were assigned to the digitized points, they 
were processed through a program that sorts the points by frame number 
and projects them onto the space images in a rectangular coordinate 
system defined by the registration marks, if they exist. These coordi- 
nates were then plotted on stable transparent material with a Hewlett- 
Packard 9100A calcul ator-plotter at an enlargement ratio calculated by 
measuring a set of known distances. The resulting overlays were then 
attached to the S-190A and S-190B images in preparation for photo 
interpretation. 
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Figure 17. Hypsocline chart of Digital Terrain Model. 
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4.0 S-190 SPACE AND HIGH-FLIGHT AERIAL INTERPRETATION MODELS 


4.1 INTRODUCTION 

In the type of multistage sampling scheme described In this report, 
measurements made at adjacent stages of the design are tied together by 
models that ultimately related remote sensor data to resource measurements 
obtained on the ground in sample areas. For example, in variable 
probability sampling, the model provides predictions which are used to 
determine unequal probabilities for drawing a sample in the next stage. 

In regression sampling, the model is used to adjust extensive but 
relatively inexpensive measurements made at the aerial stage with fewer 
data obtained from the more expensive ground stage. In stratified 
sampling, it is used to define strata in which independent ground 
sampling will take place. 

Certainly the sampling method used is an important factor with 
regard to the overall level of precision obtained in the survey. But 
the interpretation model is of crucial importance since it governs the 
transfer of information from the image into the sampling design, and 
thus makes or breaks the quality of the survey assuming the transient 
data is adequate. 

Colwell (1965) observed that photo interpretation entails two 
kinds of operations: (a) observing such photo-image characteristics as 

size, shape, shadow, tone, texture, and location, and (b) judging the 
significance of the features, based in large measure on their inter- 
relationships or "association." 
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Colwell noted that while a machine may have the capability to 
do the former, rarely is it capable of doing the latter sufficiently 
well. On the other hand, while a human being excels at judging asso- 
ciations, he may not be able to do a consistent job of interpreting 
features such as tone and texture from large amounts of image data, 
as he is soon overcome by boredom and fatigue. 


At the beginning of our investigation we were aware that image 
features most highly coorelated with biomass volume would be the spatial 
distribution of tone and texture, and that large amounts of data would 
have to be interpreted. In addition, human interpretation would not 
be entirely capable of extracting the maximum amount of information 
from multi-channel images such as provided by the S-192 scanner system. 


In view of these considerations, most of the human image interpre- 
tation effort was concentrated on relatively simple interpretive models 
appropriate to the S-190A, S-190B, and the high-flight aerial photography 
Photo interpreters were able to see individual trees on the aerial 
photographs and they were able to associate their observations with 
ground experience obtained in the area. Vegetation boundaries could be 
seen on the S- 190 photography. Even so, the interpretation process was 
sometimes difficult since it required a substantial amount of subjective 
judgment both for the aerial and space system photography. 


Before embarking on the human image interpretation effort, however, 
we deemed it advisable to subject a sample of the imagery types, 
including those from the S-192 scanner, to a simple mul tispectral 
additive viewer and a density slicer. The purpose of this was to 
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determine, in a cursory way, the possibility for achieving gains in 
sampling precision by virtue of obtaining better quality interpretive 
data related to forest variables. These simple trials and their results 
are described in the next section. 

4.2 MULTISPECTRAL COI^INING OF EREP IMAGES 

We examined S-190A photographic and S-192 scanner type images from 
Skylab passes #34 and #37 over our Trinity County, California test area. 
Instruments used during the image analysis were the I S (International 
Imaging Systems) Addcol and the VP -8 Image Analyzer from ISI (Image 
Systems International). In this section, we will describe the tech- 
niques used with each machine, the image materials used and the results 
obtained. 

4.2.1 I^S Addcol 

This machine projects light through four color optical glass filters, 
one for each of four image samples. The samples consist of selected 
bands of the multi spectral coverages of the S-190A imaging system. 

The four image samples (70 mm black-and-white positive transparencies) 
are viewed first as one composite image without filters on a diffuse 
glass viewing screen. A color optical filter is then placed between 
each of the projection lights and corresponding image sample. The 
result is a color composite image on the viewing screen. For enhancement 
purposes the light intensity and filter color can be varied for each 
image sample. When the desired image is produced on the viewing screen 
the image is copied from the screen photographical ly using Ektacolor 
or other comparable film. Our experience with the S-190A imagery 
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indicated that the following combinations of light intensity setting and 
color filter were best for interpreting forest areas: 


S-190A 
Sample Image 

Bandwidth (y) 

ih Addcol 
Light 
Intensity 

Filter 

Roll 1 

.69 - .80 

9.1 

Red 

Roll 2 

.77 -1.00 

8.7 

Red 

Roll 5 

.58 - .80 

8.5 

Green 

Rol 1 6 

.47 - .58 

10.0 

Blue 


The resulting image provided the greatest contrast and enhancement, as 
judged visually, which could be obtained with the sample images. Our 
objective was to enhance the existing forest stands in contrast to non- 
timbered and barren lands over our test area. 

The same procedure was used with S-192 screening images from 
SKYLAB pass #37- In this case we had only three image samples which 
represented the spectral bands and bandwidths of concern to us in this 
investigation. Light intensity settings and filter combinations used 
for each image sample were as follows: 


S-192 

Sample Image 

Bandwidth 

I^S Addcol 
Light 
Intensi ty 

Filter 

Band 7 

.78 - .88 

8.7 

Green 

Band 11 

1.55-1.75 

8.5 

Red 

Band 2 

.46 - .51 

7.8 

B1 ue 


Our observations of the composite images produced for the S-190A 
and S-192 image sample have led us to the following conclusions. In 
the case of the S-190 composite image we observed a loss of resolution 
by combining the images and projecting them onto a ground diffuse 
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glass viewing screen. Second or third generation color infrared 

transparencies delivered as a SKYLAB product are several orders of 

magnitude better for interpreting photographic detail in the forested 

areas of Trinity County. In addition, we found that color enhancement 

2 

of the several bands via the I S machine did not produce equally as 
good or better color contrasts for interpretation of forest vegetation 
over our test area as did the color infrared film transparencies. 

In case of the S-192 images, the photographic quality of the 
images received was fair to poor. However, we can make a few comments 
regarding our observations of the utility of the S-192 scanner as an 
imagi ng system. 

Band 2 (.46 - .51 y) has wavelengths too short to be useful in 
forest vegetation analysis. Detail is obscured and contrast among 
features is almost nil. 

Band 7 (.78 - .88 y) was the best of the three bands tried for 
vegetation analysis. Detail is clearer and contrasting vegetation 
features are interpretable by carefully studying the image. 

Band 11 (1.55 - 1.75 y) seems to produce an image similar in appearance 
to a radar image of terrain. Geographic features show some contrast, 
water bodies and water courses show sharp contrast and detail. Detail 
in these features is readily interpretable, but for vegetation analysis 
this portion of the spectrum did not appear to be appropriate for our 
test site in California. 
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4.3 VP-8 IMAGE ANALYZER 


The ISI VP-8 density slicer utilizes a video camera and television 
monitors. Between the camera and receivers are the electronics which 
convert the video signal to digital mode and measure the film density 
responses from several variable-width bands (slices). In addition to 
obtaining measurements of the proportion of an image exhibiting each 
density level, one can make point density measurements at any point on the 
image by means of an electronic cursor. Each point is uniquely iden- 
tified by a pair of X, Y coordinates that can be read on the instrument 
panel. Hence, any point on the image can be measured for density so 
long as the image is not moved with respect to the video camera. 

The more usual method of employing a density slicer in vegetation 
analysis work is to partition the image into grey levels by means of 
the system electronics. Beginning at the light er dark end of the 
density scale, one can '^paint" the screen with light, progressing from 
one end of the scale to the other by means of a knob on the panel. In 
the VP-8 system used in this investigation, there are eight separate 
knobs with the accompanying electronics. Hence, the image can be 
partitioned into eight separate grey levels simultaneously. In addition, 
each of the eight density levels can be displayed in a unique color on 

i i • . 

a video screen. After properly adjusting the density levels , the 
proportion of screen occupied by each color can be read on the machine 
console. Therefore, the relative area of each vegetation type that can 
be uniquely incorporated within a density slice is measurable. It was 
by using this technique that we evaluated the usefulness of the VP-8 

57 



in forest sampling when applied to a single EREP S-190B black-and-white 
infrared image. 

The S-190A frame number 058 from the black-and-white infrared film 
strip was the photographic medium used. This frame was first enlarged to 
a scale of 1/500,000 and printed on a transparent film base for use on^the 
VP-8 light table. The control points used in the previous resectioning of 
this frame were located on the image. Then a resectioning overlay was 
produced showing the corner locations of the one-square-mile sampling units 
The overlay was positioned on the image and placed on the VP-8 light table. 

The next step was to calibrate the instrument to the density range of 
the transparency and the number of density bands desired, four in our case. 
A color is assigned to each level, ahd, finally, the proportional readouts 
on the instrument are calibrated to an area of known size on the image. 

After calibrating the instrument to the image, we read the propor- 
tion of each sample unit occupied by each of four chosen colors. The 
density threshold levels were set to conform to four classes of timber- 
land categories based on a small training set of sample units within the 
area of the inventory. The definitions of density level, assigned color, 
and corresponding expected timberland classes are given below. 



Density 

Level 

Color 

Expected 

Timberland Class 

1 

(li ghtest) 

Orange 

Bare land 

2 

1 


Green 

Low vol ume-- scattered trees 

3 

\ 

f 

Violet 

Med. volume— open stands 

4 

(darl 

<est) 

Blue 

High vol ume--dense stands 
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The proportions of a sample unit in each of the three colors were 
recorded and considered as possible independent (X) variables in a 
regression model. The estimated volume of timber as determined from the 
SP inventory was entered as the dependent (Y) variable. The data 
format and sample unit identification is shown in the ful lowing 


example: 

Estimated 

% 

% 

% 

% 

Sample Unit 

Timber Vol. (FBM) 

Orange 

Green 

Vi ol et 

B1 ue 

Y 

Xl 

X2 

X3 

X4 

T37N R9W S5 

10,959 

32 

26 

18 

14 

4. 3.1 Results of 

VP- 8 Trials 






Two trials were made, each one consisting of 34 primary sample units, 
one square mile in size, using a linear multiple regression model of the 
general from Y = X3 + e. These two regression trials yielded the following 
statistics. 

Tabled 



Explained 

Signi 

f i can ce 

of X Variables 

Val ue of 

Tri al 

Variability (y^) 

0 range 

Green 

Violet 

B1 ue 

^30 

1 

.11 

1.36 

.76 

-1.2 

.17 

1.70 

2 

.17 

2.12 

.45 

-.56 

-.98 

1.70 


As it can be seen in the above table, the maximum percentage of the 
total variability explained by this simple model was 17 percent, a value 
generally too low to be economically useful in sampling applications. The 
variable of highest significance was "orange" which corresponds to the light 
tone on the black-and-white infrared transparency which would relate to 
healthy vegetation. 
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The model was also evaluated for the gain in sampling precision 


that can be achieved under different sampling procedures using a method 
developed by Zarkovic (1964). The results are tabled below: 


Sampling Method 
(First Stage) 

Stratified 

Variable probability (pps) 
Regression 


Percent Gain in Precision 
Trial 1 Trial 2 

5.93 12.98 
2.27 7.49 
11.48 16.53 


In the above analysis, the gain in precision is analogous to the 
reduction in the sample size required to obtain the same variance. 

It appears from the above that regression sampling would be best when 
using the VP-8 instrument in the manner described above. The 
relatively low gain from pps sampling could be attributable to (1) a 
lack of sensitivity of the VP-8 responses to variations in tree sizes, 
or (2) the regression of Y on X not passing the origin. Under sensitive 
but stable conditions, the pss estimator is capable of greater gains’ 
than regression or stratified sampling and is applicable with small 
Sample sizes, which is not the case with regression sampling. The 
economic significance of these results in relation to sampling 
efficiency is given in section 4.5 below. 


4.4 HUMAN INTERPRETATION MODELS 


4.4.1 Space Imagery 


Throughout the course of the investigation, and as S-190 imagery 
was received from NASA, Houston, human image interpretation trials were 
conducted in the Trini ty County test area. The technique used was 
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parallel to that used with the VP-8 image analyzer, except that color 
images were employed by the human interpreters. 

The first human interpretation trials which were applied to the 
imagery received from SKYLAB II utilized 1/500,000 scale enlargements. 
Later on, a more complete analysis was made of SKYLAB II and SKYLAB II 
color imagery enlarged to a scale of 1/125,000, 

In all of these trials, the approach, was the same. First, trans- 
parent overlays were generated showing the location of the one-square- 
mile primary sample units on each image. Second, interpretations were 
made of the proportion of each sample unit exhibiting each of three or 
four distinct color tones. Regression models were formulated using the 
proportion of the sample units in each color as the independent variables 
The results of these runs were screened and the ones showing the most 
promising results Were tabled. These are shown below. 

Even though up to four color class separations were made during the 
course of the investigation, three only seemed to be the maximum number 
that contributed significantly to any model. Also, the intercept was 
removed from all models without any significant reduction in the 
multiple correlation coefficients. 

In interpreting the color imagery, the color classes showing the 
most significant relationship to our estimated timber volumes are: 

Emulsion Type Most Significant Colors 

Conventional color Dark green, light green, tan 

Color infrared Deep red, bright red, blue 
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4.4.2 High-FI ight Aerial Photography 


During the course of our SKYLAB and ERTS investigations, we also 
conducted human photo interpretation trials to evaluate the usefulness 
of high-flight aerial photography in forest inventory work and to 
compare these results with those obtained with the SKYLAB EREP S-190A 
S-190B photography. 

Our aim was to develop a regression type interpretation model for 
which the independent variables could be interpreted from the high- 
flight photographs. The dependent variable used was timber volume per 
square mile as determined from our survey of SP timberlands in Trinity 
County. In comparing the results of these trials with those obtained 
from interpreting EREP S-190 data, it should be kept in mind that indi- 
dividual trees are discernible on high-flight aerial photography and 
not on the EREP photography. 

We started out with the interpretation of eight variables. These 
were (1) the percentage of a parcel on southern exposure, (2) the 
percentage of southern exposure covered with coniferous forest, (3) crown 
density of the conifer covered portion, (4) percentage of large trees 
on this portion, and the variables (5) through (8) being a repeat of the 
first four variables for the northern exposure. 

In this first interpretation experiment we interpreted black-and-white 
prints of the U-2 color infrared photographs. Two interpreters estimated 
the 8 variables for the 40 sections twice. The initial idea was to 
average the four sets Of interpreted data to reduce their variability and 
to reduce interprecer bias. For this purpose we did an analysis of 
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variance to determine if there was a significant difference between 
interpreters, and to test the homogeneity of variance among the different 
data sets. The outcome of these tests showed that the data could be 
averaged. Special care was taken to make the data consistent by using 
the conference interpretation system, where common standards for the 
interpretation were continually established. 

Using the averaged eight variables, we tried six different models 
incorporating linear and non-linear combinations of the variables. 

The outcomes did not differ significantly from model to model. The 
multiple correlation coefficient was in the order of 0.65 and the 
standard error of the estimate was of the magnitude of 2.5 million 
board feet (bd. ft.) per square mile. We discovered, however, that we 
could economize on the interpretation effort by averaging the variables 
for the northern and southern exposures, as this distinction apparently 
did not contribute to the overall result. , 

In the next experiment the number of variables was reduced to 
three; namely: (1) percentage of the section covered by coniferous 

forest (C) , (2) crown cover density of the coniferous forest (D) , and 
(3) the percentage of the stand in large trees (L). 

In addition, we investigated two other factors; the difference 
between the use of black-and-white or color infrared photographs, and 
the difference between interpreters. The results of the second 
experiment in which we tried three models, two interpreters, and two 
kinds of photography are given in Table 4. 
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Table 4 

Results of the High-Flight Photo Interpretation Experiment 


Model 

Black-and-White 

Color Infrared 

Int. 1 Int. 2 

Int. 1 

Int. 2 

■ 


Multiple correlation coefficient 

H 

V=MkD+CD2+CL+CL2 

0.71 

0.74 

0.64 

H 

V=M+CD+Cd2 

0.58 

0.58 

0.53 

1 

v=m+c|d+d2+l+l2] 

0.72 

0.74 

0.65 


In Table 4 there are no entries for the second interpreter under 
black-and-white photography, since this interpreter was no longer 
available. However, it can be concluded from the color infrared inter- 
pretation that there was indeed a significant difference between 
interpreters . 

The second conclusion that can be drawn from the experimental 
outcomes is that the difference between color infrared and black-and- 
white photography is very small, so that the type of photography used 
does not seem to be an important consideration in this case. 

The variable percentage of large trees (L) was a source of 
complaint during the interpretation work. Highly subjective judgment 
was required for its interpretation. We, therefore, omitted this 
variable (model 2) to determine its influence on the multiple correlation 
coefficient. The decrease in this coefficient, however, indicated that 
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the variable made an important contribution and, thus, it was included in 
the final model. 

The third model was the final one selected for testing in a sample 
survey situation. 

4.5 SUMMARY OF RESULTS 

In Table 5, we have summarized the most significant results of the 
S-190 and high-flight aerial image interpretation trials as they relate 
to timb'^r volume estimation in Trinity County employing one-square-mile 
primary sampling units. 

The most important results from the sampling standpoint are the 
relative gains that could be achieved using the different sampling 
methods, (1) stratified, (2) probability proportional to size (pps), 
and (3) regression. The gains listed in the table are in relation to 
what would be achieved under simple random sampling. The reason for 
this comparison is that under simple random sampling, primary sample units 
would be drawn without benefit of any prior information concerning the 
resource base, in other words, without taking advantage of the inter- 
pretive data available from the aircraft or space imagery. All of the 
three plans listed do take advantage of the image data but in different 
ways. Therefore, any one of the three methods may be most appropriate 
in specific situations. Interactions among sample units, the interaction 
between the remote sensor and ground data, the sample size, and even the 
experience of the interpreter in the local area can affect the best 
selection of sampling method. Nevertheless, it can be seen from the 
table that all three of the sample methods shown do take some advantage 
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Table 5 

Selected Statistics from the Space and Aerial Image Interpretation Trials Employing 
One-Square-Mile Sample Units and Comparison of Alternative Sampling Methods 








1 


1 

Image 

Type 

Month 

of 

Aqui- 

sition 

Image 
Seal e 


Sample 

Size 

Proportion of 
variability 
explained by 
the model (r^) 

Average % Gain in Sampling 
Precision Compared to 
Simple Random 



IBB 


Stratified 

pps. 

Regression 

S;-190A 
BS.W IR 

June 

1/500,000 

VP-8 

34 

.14 

9.5 

4.9 

14.0 

S-190A 
Color IR 

June 

1/500,000 

Human 

114 

.38 

23.4 

30.5 

38.4 

S-190A 

Color 

June 

1/125,000 

M 

170 

.24 

15.0 

21.3 

24.4 

S-1 90A 
Color IR 

June 

1/125,000 

II 

184 

.25 

17.3 

22.8 

25.3 

S-190A 

Color 

Sept. 

1/125,000 

n 

184 

.38 

28.1 

32.1 

38.8 

S-190A 
Color IR 

Sept. 

1/125,000 

It 

185 

.36 

25,2 

31.9 

36.2 

S-190A 
Color I R 

June/Sept. 

Composite 

1/125,000 

11 

140 

-.43 

35.1 

39.5 

43.3 

S-190B 

Color 

Sept. 

1/125,000 

11 

186 

.37 

27.8 

35.6 

37.3 

High - 
Flight 
Aerial 
Color IR 

June 

1/125,000 

ii 

40 

.55 

43.4 

49,9 

55.1 



Average Gain 

(Human Interpretation 

of Space imagery) 
)tandard Deviation 

24.6 

30.5 

34.8 




c 

6.8 

6.5 

7.1 














of all kinds of imagery in increasing the precision of a forest inventory 
for a given sample size. 

In evaluating the economic benefit of EREP S-190 imagery when used in 
this manner, one must consider the cost of obtaining and interpreting the 
imagery in relation to the cost savings achieved by virtue of the 
reduction in the required sample size. 

From Table 2, page 27, it can be determined that in 1972, the cost 
of utilizing two 70 mm strips and one crew day, to obtain tree measure- 
ments, per one-square-mile primary unit was $909.48 (2(267.24) + 375.00). 
Therefore, an allowable reduction of 20 % in the sample size in a situation 
requiring 50 primary sample units would result in a sampling cost saving 
of $9,094.80 that could be credited toward obtaining and interpreting 
the space acquired data. Table 6 lists the revenues that would be 
available for image acquisition and interpretation when employing selected 
S-190 products into the design, assuming a base of 100 psu's under simple 
random sampling. 

On the other hand, there are other data types that are sometimes 
available, such as aerial photography, that can be used instead of space 
acquired data. Therefore, the feasibility of implementing a satellite 
remote sensor program must heavily depend on the multidisciplinary 
aspects of the program, the size of the area covered, the consistency of 
the data, and the frequency of coverage. It might be appropriately 
noted here that, in the forest inventory business, there is always a 
shortage of good quality imagery covering the right area at the right 
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Tab! e 6 


Revenue Made Avail able for Image Interpretation under Three Sampling Methods Compared 
to a Base of 100 Primary Sample Units under Simple Random Sampling 
(Figures assume a subsampling plan of 2 low altitude 70 mm strips 
and one crew day on the ground per psu, 1972 data) 


Image 

Month of 

Cost of Data Acquisition* 
within P ri ma ry Un i ts 
(Thousands of Dollars) 

Revenue Made Available 
for Image Interpretation 
(Thousands of Dollars) 


Acquisition 

Stratified 


Regression 

Strati fi ed 

pps 

Regression 

S-190A 

Color 

June 

77.3 

71.6 

68.8 

13.6 

19.4 

22.2 

S-190A 
Color IR 

June 

75.2 

70.2 

67.9 

15.7 

20.7 

23.0 

S-190A 

Color 

Sept. 

65.4 

61.8 

55.7 

25.6 

29.2 

35.3 

S-I90A 
Color IR 

Sept. 

68.0 

61.9 

58.0 

22.9 

29,0 

32.9 

S-190A; 
Color IR 

June & Sept. 
Compos i te 

59.0 

55.0 

51.6 

31.9 

35.9 

39.4 

S-190B 

Color 

Sept. 

65.7 

58.6 

57.0 

25.3 

32.4 

33.9 


*Cost under simple random sampling for 100 psu's is 90.9 thousand dollars. 




time. Amelioration of these problems alone would go a long way in 
advancing the art of resource information management. 

There are several significant results that can be observed in 
Table 5. 

a. The results from the VP-8 experiments were disappointing. 
Although some gain was achieved, considerably more effort 
would be needed to develop this type of instrument into 

a viable competitor in forest survey work of the type 
investigated here. 

b. The S-190A products obtained in September yielded 
significantly higher gains thanthe products obtained in 
early June. This was because of a greater spectral 
difference between commercial conifers (evergreen) and 
noncommercial broadleafs, brush, and grass in September 
compared to June. 

c. The color composite incorporating June and September 
S-190A imagery yielded the highest gains of any of the 
space acquired data tried. Therefore, temporal varia- 
tions were important in our analysis (i.e., two timely 
coverages are significantly better than one). 

d. There was no significant difference between the gains 
achieved with S-190B imagery compared to S-190A imagery. 

e. There was no significant difference between S-190A color 
versus S-190A color IR. 
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f. The gains in sampling precision were significantly better 
with the high-flight aerial imagery than with any of 
the space acquired data. This is even more significant 
when considering that the June S-190A imagery, when the 
aircraft data was acquired, gave poorer results than the 
September S-190A products. Therefore, the capability 
to resolve trees yields significantly better results 
and is more important than seasonal variations. 

We have concluded that some of the reasons for the similarity in 
the results of the S-190A and S-190B image interpretations is that 
significant vegetation type boundaries can be delineated on all the space 
imagery used and trees are discernible on none of them. Because of 
this, important data concerning the density of commercial species, 
bole sizes, and tree heights cannot be ascertained. Therefore, the 
conclusion must be drawn that the EREP S-190 data is of significant , 
but limited usefulness in forest inventories using human interpretation 
methods. The general conclusion is that the number of primary sample 
units required to achieve a given level of variance can be reduced from 
24 percent to 43 percent by introducing EREP type information into the 
sampling design. This general conclusion was indicated in the Apollo 9 
experiment, our ERTS investigation, and re-verified in the present 
investigation. To achieve further gains from space systems, and to 
obtain a capability for forest stand mapping for management purposes, 
it would be necessary to increase the resolution to a point where 
trees are discernible. At that point, a new plateau in capability 
would be reached. 
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There are a few other important points, both pro and con, that come 
out of the investigation that are narratively commented on. below. These 
comments are in line with Mr. Ryborn Kirby's letter of April 1, 1975 
to the Principal Investigator. 

(a) The higher spatial resolution of the S-190B appears to us to 
have definite advantages over the S-190A system both from the vegetation 
mapping standpoint and from the operational standpoint. From the 
mapping standpoint, vegetation boundaries are more clearly defined and 
hence could be mapped to a greater degree of accuracy. As a result of 
this, acreage measurements would be more accurate. This may or may not 
increase the precision of timber volume estimates depending on the 
sampling method used. 

(b) The improved spectral discrimination of the S-190A system did 
not seem to contribute significantly to our investigation. All of 

the major vegetation type differences could be discriminated on all the 
image types. The differences were simply a matter of degree. By and 
large, the infrared color imagery is the best single product. It cuts 
the haze best and differentiates between commercial conifer areas and 
non-commerci al hardwood and brush areas. Color infrared with a 
capability to resolve trees, such as with FiB-57 high-flight photograr'-y, 
would be super. 

(c) The consistency of sun angle is probably one of the more 
important attributes attainable with space imagery for forest survey 
work. Consistency over space and time is extremely important in 
allowing the development of machine assisted interpretation models 
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and techniques. In past experiments aimed at determining the feasibility of 
automating forest photo interpretation with aerial photography, the variations 
in reflectance attributable to variation in sun angle and topography were 
always a major cause of variability. Therefore, maintaining a steady sun 
angle by latitude would be beneficial. Seasonal variations, however, are 
an aid to forest image interpretation because grass, brush , broadleaf, 
and conifer vegetation all undergo differential changes in their 
reflectance characteristics over time. These differences can be 
capitalized on in developing machine-assisted interpretation models. 

In our investigation, we found significant differences due to seasonal 
variation as shown in Table 5. 

(d) Improvements in resolution, particularjly if trees could be 
resolved, formatted to a larger scale, say 1/125,000, would probably 
be of more benefit in forest surveys than further improvements in 
spectral discrimination. High resolution also contributes to the 
operational efficiency by facilitating the identification of sample areas 
from aricraft and on the ground in multistage surveys. This is an 
important consideration from the standpoint of operational efficiency. 

(e) There is no establish standard of precision for forest surveys. 

It usually varies by the size of the area to be surveyed and the cost of 
making a wrong decision. The Forest Survey maintained a standard of 
precision of 5 percent per million acres for area estimates and 10 
percent per billion board feet for volume estimates evaluated at the 

67 percent confidence level, Many private owners try for a standard 
of 10 percent on total volume evaluated at the 95 percent confidence 
level. The main significance attributable to the level of precision 
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is the survey cost to achieve that precision balanced against the cost 
of a wrong decision that may be attributable to inadequate information. 

(f) We feel that the most useful single SKYLAB-type product for 
forest surveys is the S-190B imagery enlarged to a scale of about 
1/125,000. If better resolution could be obtained, color enlargements 
to a scale of 1/63,360 (1" = 1 mile) would be extremely useful in 
forestry work, particularly if they could be supplied at frequent inter- 
vals of once or twice a year. This type of product would assist 
managers in updating harvest areas and generally keeping track of changes 
in the resource base. This is one of the biggest single problems in 
resource information management. 

(g) SKYLAB-type data can almost provide a remote sensor program 
geared to land management but not quite. We feel that an increase in 
resolution is necessary for widespread application. Then, the application 
would be tremendous for resource mapping, monitoring, inventorying, and 
management. 
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5.0 DIGITAL INTERPRETATION MODELS 


One of the initial objectives of this investigation was to develop 
S-192 digital interpretation models expressly for tinter volume estima- 
tion by one-square-mile sample units. We did, in fact, develop the 
programs to the point where the sample units can be isolated in the 
computer ready for analysis. And we did produce shade prints covering 
our test area (Figure 18). Unfortunately, a change from the Univac 1108 
Exec II to the Exec VIII monitor system at the ISD computer center, at 
the crucial time when the S-192 tapes were finally received from Houston, 
made it impossible for us to complete the digital model investigation 
within the remaining time and available funding. 

The description of the analysis system, which is now nearly ready 
to be implemented using S-192 data, was included in our ERTS report to 
Goddard Space Flight Center, Greenbelt, Maryland. For a complete 
description of the digital MSS timber volume interpretation system, see 
section 4.3, pages 39 through 56 of the final report, "Investigation to 
Develop a Multistage Forest Sampling Inventory System Using ERTS-1 
Imagery," ERTS Investigation #174. 

Since the S-192 scanner system employs many more channels than the 
ERTS system, the task of screening to isolate the channels most appro- 
priate for forest inventory purposes could be a formidable one. On the 
other hand, the greater selectivity of channels and the greater 
resolution of the S-192 scanner compared to the LANDSAT MSS could result 
in a more significant interpretive system. This would be an appropriate 
subject for a future investigation. 
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Figure 18. Shade print of test area produced from S-192 digital tape. 
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APPENDIX A 


THE THEORY OF MULTISTAGE VARIABLE PROBABILITY 
SAMPLING WITH REPLACEMENT 

General Description 

Sampling with unequal probabilities has been known to statis- 
ticians for some time. It is often referred to as sampling with 
probability proportional to size (pps). Hansen and Hurwitz (1949) 
applied it to census surveys and since that time various forms of 
the method have been described and used by many people. A notable 
example in forestry is due to Grosenbaugh (1958) who demonstrated 
that Bitterlich point sampling with a prism or angle gauge selects 
trees with probability proportional to basal area. Several years 
ago, Langley envisioned the use of pps sampling in timber 
management inventories and taxation surveys when prior information 
concerning the target population was available from file records 
or aerial photographs. 

In the early 1960's Langley suggested the use of high alti- 
tude photographs as a means of increasing the efficiency of forest 
surveys that covered large areas (Langley, 1961). Later, he 
thought about the possibilities of incorporating photointerpreta- 
tion data in timber inventories by means of one-stage pps sampling 
In about 1965, he considered the possibilities of incorporating 
multistage pps sampling into forest surveys. However, it was not 
until the Douglas-fir mortality surveys of 1967 (Wert and 
Roettgering, 1968) that a pressing incentive was provided to 
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develop the idea more fully. The Apollo 9 experiment in 
1969, the Southern Pacific Inventory of 1971, the NASA ERTS-1 
experiment, and finally the present SKYLAB/EREP investigation 
provided the incentives to complete the derivation of the 
optimum allocation procedures as presented in Appendix C. 

In the context of this paper, sampling with replacement in 
the multistage case means that each time a unit is drawn repeti- 
tively in any stage, a new independent subsample must be drawn in 
all subsequent stages. 


O ne-Stage Variabl e Probability Sampling 

To illustrate the method, suppose we write a one-stage random 


sampling estimator for total forest volume on a tract of land as 



n 




in which 

v^. is the measured timber volume on the i^*^ sample 
unit, 

n is the number of units included in the sample and 
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N is the number of units in the target population. 
Formula (1) is, of course, equivalent to 


„ 1 V / * 


From sampling theory, we know that is the probability of 

■f* h 

selecting the i ^ pupulation unit at any given draw. Therefore 


-I n V . 

V = 1 J -± 

" 1=1 p 


in which P “ ^ ''S the constant probability of selection. Each 
ratio v^./p estimates the population total. 

If the probability of selection is derived from some other 
criterion, such as predicted timber volume derived from aerial 
photographs, we may write 


P. = C./,I^C. 


in which C. is the predicted volume of timber in the i^^ popula- 
^ N 

tion unit. Since J P. = 1.0, we satisfy the requirements of a 

i=l V 

probability distribution. The new estimator, known as the pps 
estimator, may be written as 


-1 y ^ 
n >1 P| 


( 2 ) 


This estimator is unbiased as long as the observations are physi- 
cally drawn according to the prescribed probabilities. This is 
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shov/n by evaluating the expected value, i.e. 


n V 


E(v) = E[-E I a] 

p.-" 


N V. 

= y p -- 
Pi 

N 

= I V. 
i--lV 

= V 


where V is the population parameter sought. 

The sample selection is easily made by drawing n uniformly 

distributed random numbers with equal probability from a list of 

N 

numbers in the interval T to I C. . When selection is with 

i=l ^ 

replacement, a forest unit is included in the sample once each 
time a random number falls within its range on a cumulative list 
of the C.'s (Table A. 1, Col. 4); i.e. whenever 


k k+1 

I Ct < r . < I C. 
i=l V J i=l ^ 


where r. is the number selected at the draw. For example, 

J . 

if we select three numbers at random between 1 and 154 (T54 is 
N 

y C. in Table 1), and happen to draw 12, 85, and T30, v/e have 
i=T ^ 

selected units 2, 5, and 7 to be measured. This can be seen by 
referring to Col . 4 of the table. The first number, 12, is greater 
than TO, the upper bound for unit 1 but less than 40, the upper 
bound for unit 2 and so on. Furthermore, the samples ha*/e been 
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Table A.l 


Illustration of the Reduction of the Sample Variance with Variance Probability Sampling 



Variable Probability Sampling 


Equal Probability Sampling 


: (1) 

i 

1 Uni t 
■ (i): 

i 

(2) 

T imber 
Volume 

(vp 

(3) 

Predicted 

Volume 

(cp: 

(4) Jl 

Cum. Sum'] 
N 

Tg- t 

ih: ■ 

(5) 

c. 

P = 

1 EC^. 

(6) 

vpp, 

(7) 

(V./P.-V)^ 

(8) i 

i 

P.(V./P. - V)^, 
111 ' 1 

(9) 

P=l/12 

(10) 

vpp 

(11) 

(vpp-v)^ 

(12) 1 
2 ' 

P(v./P-V)'' 

1 \ 
r 

! T 

8 

10 

10 i 

.0649 

123 

729 

47 I 

.0833 

96 

2916 

243 : 

t 

; 2 

34 

30 

40* ! 

. 1 948 

175 

625 

122 S 

j 


408 

66564 

5545 ; 

: 3 

18 

15: 

: 55 ! 

.0974 

185 

1225 

119 


216 

4356 

363 i 

i 

; 4 

14 

20 

75 1 

I 

.1298 

108 

1764 

229 

fi 

168 

324 

27 1 

' 5- 

45 

50 

125* j 

.3247 

139 

121 

39 

ii 

540 

152100 

12670 : 

j 

i 6 

2 

1 

126 1 

.0065 

308 

24964 

162 

M 

24 

15876 

1322 ! 

j 

: 7 

8 

10 

136* j 

.0649 

123 

729 

47 


96 

2916 

243 j 

is 

3 

5 

141 j 

.0325 

92 

3364 

109 

Ci 

36 

12996 

1083 1 

! 9 

6 

5 

146 1 

.0325 

185 

1225 

40 

Ii 

72 

6084 

507 i 
1 

i 10 

7 

5 

1 51 1 

.0325 

215 

4225 

137 


84 

4356 

363 i 

hi 

0 

1 

: 152 ^ 

! .0065 

0 

22500 

146 

M 

0 

22500 

1874 : 

i ^2 

:hS 

2 

: 154 

! .0130 
1 

385 

55225 

718 

U 

60 

8100 

675 ; 

Total 

T ■ 

1 50 = V 

154 

jh.OO 



1915 

1 


i 24,915 1 


'The C-j's shown in column 3 are closer to the V-j's in column 2 than vrauld normally be obtained in prac- 
tice unless they had been corrected by, say, regression. However, the correlation coefficient R ivhich 
is equal to .97 in this example, is of the same magnitude reported by Hear et al (1964) and therefore 
serves as a valid, though optimistic, example for demonstrating the use of photo predictions in variable 
probability sampling. 



selected with probability proportional to the photo estimates 
because unit 2 has <10-10 = 30 chances out of 154 of being selected, 
unit 5 has a 50/154 chance and unit 7 has a 10/154 chance of being 
selected. In other words, the sampling probabilities are equal 
over units of predicted volume rather than by the frequency of 
forest units as indicated by formula (1). Note that in neither 
case is it necessary to know the area of the forest units. When 
sampling is with replacement, the variance of the pps estimator v, 
over all possible samples of size n is (Cochran 1963, p. 253) 

Var(v) = ); P.(p--- V)^ (3) 

in which 

n is the sample size, 

N is the number of units in the population, 

t h 

is the probabi 1 ity of selecting the i^^ unit at 
the draw, 

t h 

y. is the timber volume in the i“ unit, 

V is the total volume in the forest. 

Clearly, Var(v) is dependent on the squared deviations of the 
ratios V^./Pv from V, The higher the linear correlation between 
the V^.'s and the Pij's, the less variability there is among the 
ratios V^./P^. If the V^.'s and the P.'s exhibited perfect pro- 
portionality the variance of v would be zero because the values 
of V^/P^ would be the same for all population units and would 
equal V. Needless to say, this is never the case, for if it were, 
sampling vvould be unnecessary. 
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Using the values in Table A. 1 , the Var(v) can be calculated 
for a sample size of, say, three for both variable probability 
sampling and equal probability (simple random) sampling, respec- 
tively. Accordingly, 

Var(Vppj) = 1(1916) = 638 

Var{v^P = i{24.915) = 8305 

The sampling error (standard deviation) in each case is the square 
root of the variance. 

S.E.^pp^^ = /638 = 25.3 = 17% of V 
= /83TT5 = 91.1 = 61% of V 

An unbiased estimate of the variance may be computed from a. single 
sample using the formula (Cochran 1963, p. 254) 

var(v) = -nv^] ' (4) 

This is verified in Appendix B for two-stage sampling. 

T wo-St ag e Va r iable Probabil ity Sampling 

When the primary sample units are too large for measuring the 
total attribute on the ground, as when measuring the volume of a 
large number of trees, a second stage may be added to the design 
yielding a new estimator 
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V 


V 




V . 

„J_ 


Pi 



111 

I 


1=1 


— y 
0 = 1 



( 5 ) 


in which 

f h 

V-- is the measured volume in the j ^ subunit in 

' vJ 

the i*"^ primary unit, 

p. . is the conditional probability of drawing the 
subunit given the i^^ primary unit, 
p^. is the probability of drawing the i^^ primary 
unit, 

n^ is the number of observations drawn from the 
primary unit, and 

m is the number of primary units drawn from the 
population. 

The two-stage pps estimator is unbiased if the observations are 
drawn as indicated by the selection probabilities. The second 
stage selection probabilities -- conditional on the first stage 
unit having been drawn -- may be defined as 


N. 

1 


P..=C../ YC.. 
ij >1 IJ 


1 J 


J-1 


• th 


in which . is the criterion value assigned to the j subunit 


in the i^^ primary unit and N. is the number of secondary sample 

’’ N • 

th J . 

units in the population of the i primary unit. Then i P-p “ ^ 

i = l 'd 

t h J 

for each i primary uni t. If J:he sum of the second stage predic- 
tions are adjusted such that T C.. is equal to C. from the 

j=l T ■ : ^ : ■ 
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first stage selections, the Lwo-stage estimator may be simplified 
to 


V 


C 

in’ 


111 


i=l 


n'. 

1 


1 V. . 

I 

j=^l ij 


(6) 


M 

in which C = C^. where M is the total number of primary 

i = l 

units in the target population. 

As derived in Appendix B, the variance of v over all 
possible samples of in primary units and n^. second stage units 
when sampling with replacement is 


Var(v) = ^ 


i=l ' i 
M V 


N ? 

M T 'i' Vt. . 

I- y __L._r y y^-i 

1=1 pi"i j=i ' 


,i Pi(pi-v)^ + i ■): 
1=1 ^ M i 


M 


^ij 


i=l Pi^i d=l ^'^/ij ^ 


(7) 


(7a) 


in which 

V. . is the timber volume (or other attribute) in 

‘ U ’ 

■f* In +* In 

the j' subunit of the i^' primary unit, 

V^. is the volume in the i^*^ population unit, and 
V is the total volume of timber in the population. 
The other terms are as defined earlier. 

As shown in Appendix B, an unbiased estimate of Var(v) 
for two- (or more) stage variable probability sampling may be cal- 
culated from the first stage estimates as with one-stage pps 
sampl ing (formula 4) . 
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Three-^Staqe Variable Probability Sampling 

It often happens in forest sampling problems. that even second 
stage sample units are too large for complete enumeration on the 
ground. This situation may arise when (1) very high altitude or 
space photography are used at the first stage, (2) the second 
stage units consist of long strips of large scale photography 
extending a mile or more, or (3) precise tree volumes are obtained 
on a subsample of trees in a plot by means of an optical dendro- 
meter. In these cases, three stages or more may be included in 
the sample design. 

The three-stage variable probability sampling estimator is 
simply an extension of the two-stage estimator (formula 5). It 
may be written as 



m 


m 1 

V _L 

p.n 

1=1 ^1 1 j = 


t . . 

MJ V 


I 


=1 k=;i P 


ijk 
i jk 



in which 


+“ h 

v.jj^ is the measured volume in the k, ^ third stage 
unit in the second stage unit in the i^!^ 


primary unit, 


P-i-,- 1 / selection probability associated with' 

1 J K 

V.V,;, defined as c. . I /c- -, 

ijk’ Tjk' ij ’ 

t| . is the number of third stage units in the ij^*^ 
first and second stage units, and the other 
terms are as defined earlier. 


when the predicted values are adjusted so that their sum 
equal s the values at the next higher stage ,i,e. , when 
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.13 


)■■ C . . . = C . . 

ijk ij 


k=l 


y c. . = c. 

IJ 1 


3=1 


>; C. = c 

i=l , 


the three-stage estimator may be simplified to 


P m I i 

V = ^ y ^ 

n • , . . 

' J 


^13 V 


ijk. 


V J_ V I y _'_J 
L rj L ¥ L p 

i=l ^ j=l ^ij k=l “^ii 


3k 


(9) 


If n. and t.. always equal one, formula (9) further simplifies 
^ • ’ J 

to 



111 

i 

i=l 



( 10 ) 


The reasonabTeness Of this final simplification would depend, of 
course, on the outcome of optimum allocation. ‘ From early trials , 
it appears that large scale aerial photographs contain' sufficient 
information to control the second or third stage variation within 
comparatively narrow limits compared to the first stage variance. 
One particular calculation of optimal allocation in a small two- 
stage triaT survey in Shasta County, California showed that n| 
should average about 1.6 for each first stage unit drawn. It may 
turn out that Improved photo interpretation techniques for large 
and small scale color photography will resul t in optimum n^. 's 
of 1. However, optimum allocation depends on cost in addition 
to the variance (see Appendix C). 



The variance of v for three-stage variable probability 
sampling with replacement is 


1 M 2 

I r r 1 KiC.- 


Var(v) = .pi.v^] +I ^ 


i=l ' i 
M 


M , ^i V^. p 

'll ir-i-V?] 


I 


i=l *^i i j=l ij 


N 


1 


T. . 2 

y — .1.,... y __J r P' ] 

i=rTi"i k=i Tijk ’j 

, M V. „ , M , V.. . 

i I Vp^-'')^ + i I rV I 

i = l ' i=l '^l"i j=l . ^ 


M 


+ -L y _L_ y ._.J y^p (Uil_ V )' 

i=l ^i'^ j=l '^ij^ij k=l 


1 


T. . 

13 


i jk 


(Tl ) 


( 12 ) 


in which all the terms are analogous to the first and second stage 
estimators defined earlier. The derivation of this variance is 
given in Appendix B. 


Stratified Sampling 

In situations where the forest is partitioned into strata, con- 
sisting of forest stands delineated on high altitude aerial or space 
photographs, stratified sampl ing would be employed before the first 
stage. The estimator for totaT forest volume using stratified 
three-stage variable probability sampling is simply the sum of the _ 
i ridividuaT stratum estimates , symbol i cal ly described as 



H 




H 

= I: 

h=l 


m,. 


'hi 

I 


'hij V 

i 


hijk 


'^^h i = l ^hi’^hi j=l '^hij^hij k=l '^hijk 
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(13) 


in which all the terms are as before except for h and H denoting 
the individual stratum and number of strata respectively. 


The Gain from Sampling with Variable Probabilities 

According to Murthy (1967, p. 197), the variance of v for 
one-stage simple random sampling with replacement may be estimated 
from a pps sample from the formula 


Var(v) 


srs 


-(N:I 


n y. 


i=vPi 


nY^ ) + -- Var(v) 
pps^ n ■ ^pps 


If we Substitute the estimated variance from pps sampling in the 
second term and also subtract it out and simpl ify, we obtain the 
estimated gain of pps sampling over equal probabi 1 ity sampl ing. 
The result may be expressed as . . 


n y 


2 


n 1 = 1 ^1 


(u: 


As it can be seen from the formula, the gain decreases sub- 
stantially as n gets large. Hence it would appear that the 
greatest gains are possible with snialT sample sizes and more parti' 
cularly when N is Targe. Therefore, under certain conditions 
pps sampling may provide better estimators than regression or 
ratio sampling when the sample size is small. T - ^ 


Transformations on Supplementary Variables ' 

As it is shown earlier, the pps estimator is unbiased as long 
as the observations are drawn as prescribed by the predetermined 
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selection probabilities. 

Even though the es.tiniator is unbiased, however, the most pre- 
cise results are obtained when the scatter is tight and the rela- 
tionship between the prediction and measurement variables is linear 
through the origin (proportional). It often happens in forest 
photointerpretation work that measurement variables, such as timber 
volume, are curvil inearly related to prediction variables such as 
tree crown diameter. Curvil inearity of this kind would certainly 
•lead to a higher variance. By subjecting the prediction variable 
to a suitable transformation, the relationship can be made nearly 
linear through the origin yielding significantly better results. 

For example, Aldred (1971) reports the following model can be used 
to predict white spruce volume in Eastern Canada from large scale 
air photographs. 

V = a + b(H log(CA) ) 

in which a and b are regression coefficients, H 'is photo- 
measured tree height, CA is photo-measured crown area' and V 
is the predicted volume of the tree in cubic feet. Similar func- 
tions are widely used to predict stand volumes from aerial photo- 
graphs which in turn can be used as selection probabilities in 
variable probability sampling. Translating the X scale so as to 
remove the X intercept can improve the proportional ity betv/een 
the X and Y variables and lower the variance in pps sampling. 
Some examples, taken from the Apollo 9 forest inventory experiment, 
are given in the following pages. 
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Figures A1 through A2 show the relationships between the timber 
volume per primary sample unit as estimated from subsampling and 
the volume per psu as determined from the space photos by means of 
photointerpretation. Figures Aland A2 show these relationships 
for the two strata in the state of Georgia. Figure A3 shows the 
same data as Figure A2 after translating the X scale to the right 
by an amount equal to the estimated X intercept from linear regres- 
sion. After making this translation, the estimated rel-variance 
of the ratios v^./p^- for that stratum is substantially reduced as 
shown in Table A. 2. While the sample size here is too small to 
draw any general conclusions, it is conceivable that in situations 
where the regression line between the Y and the X variates does 
not pass through the origin, a translation of the X scale to make 
it so can improve the overall proportional ity of the two variates 
and hence improve the pps estimator. 

Table A.2 

Rel-Variances of v^./p^. from the First Stage 
of the Apollo 9 Inventory Study . 




Georgia 

Area 

Mississippi 
Valley Area 

Unstratified 


.4409 

.2125 

Stratified (2 

strata) 

,0507 

..-0424::::: 

Stratified and 

translated 

.0033 

.0563 


(X-axis ) 

Figures :A4 through A6 show the felatirinships between the Y 
and X vartates for the two strata in the Mississippi River Val ley. 
Figure A6 shows the regression through the origin in; the bottomland 
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Volume Per PSD As Estimated From Subsampling 



Volume Per PSU As Determined From Space Photographs 


Figure A1 * Apoll o 9 Timber Inventory Study , Western Georg i a . 

Estimated volume vs. photo volume at first stage. 
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Y 



Fi gure A2 . AppT To 9 T i niber Inv entory S tudy , At1 a n ta Area . 
Estimated vs. photo volume at first stage. 
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Volume Per PSU As Estimated From Subsampling 



Figure A3. Apollo 9 Timber Inventory Study, Atlanta Area. 

Estimated vs. photo volume after translation 
of X scalp. • 
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Timber Volume Per PSU As Determined Frorii Space Photoaraphs 


Fi gure A4 • Apollo 9 Timber Inventory Study , Mi s s i s s i pp i 
Valley Pine Stratum. Estimated vs. photo 
volume at first stage. 





Figure A5. Apollo 9 Timber Inventory Study, Mississippi 

Valley Bottomland Hardwood Stratuiii, Estimated 
' vs .photo volume at first stage. 



Volume Per PSU As Estimated From Subsampling 



Timber Volume Per PSU As Determined From Space Photograplvs 


Figure A6. Apollo 9 Timber Inventory Study, Mississippi 

Valley Bottomland Hardwood Stratum. Estimated 
vs. photo volume after translation of X scale. 
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hardwood stratum after translating the X scale to the left. 
In this case, however, no gain in the rel-variance can be 
shown after the translation (Table A. 2). On the other hand, 
no significant harm appears to be done by the translation. 



APPENDIX B 
OPTIMUM ALLOCATION 

In order to take maximum advantage of th.e funds expefioed for 
a timber survey using aerial photographs, it is helpful to have 
some idea about the relative sample size required in each stage 
of the design. Optimum allocation may be approached from two 
directions: (1) to minimize variance for a specified cost or 

(2) to minimize cost for a specified variance. The former 
approach is pursued herein. 

Murthy (1967, p. 335) discusses optimum al location in two- 
and three-stage designs of various configurations of sample selec- 
tion. In his derivation of the optimal number of sample units to 
use in each stage, Murthy uses Lagrange multipliers, which is the 
method used here. However, this author presents the derivation for 
the cases where pps sampling is used in all stages of two- and 
three-stage designs. In addition, an approximate procedure is pre- 
sented here for a variable alTocation of samples in three-stage 
sampling which depends on the size of the units drawn in the first 
and second stages. 


Two-Stage Sampling 

Drawing from formula (7a) , we can write the variance of the 
two-stage pps estimator as , , ■ 


Var{v) = ~ + i n. p. 


1 ^2i 


-.2 
1 


0 


__L + J_'- J 
m m n^ 


(15) 


1 : « -21 
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in which 


°2i 


= o‘,/P. 


In introducing the cost constraint, we assume the simplest 
linear function 


m 

D = iiiD, + ( n. )D, 

‘ i^l ^ 

in which 

D is the cost of the survey excluding overhead costs, 

D-j is the average cost of measuring a first stage unit and 
includes the cost of enumerating the predictions (i.e., 
the cost of obtaining the C|'s), 

D 2 is the average cost of measuring a second stage unit, 
including travel costs. 

Then.,. ^ ; 


M - 

D* = E(D) = niD^ + m( i^P.n. )D 2 

= mD^. + mhD^ • .( 16 ) 


■ ■ M . ' 

in v;hich h = J P -ru , the expected average number of second 
i'=i. . - 

stage units per first stage unit. Now let N - Mh. Then by 

extracting from (15) and forming the Lagrange mul tipi ier , define 



2i 


i=l 




M 

+ X(M I P.n. - N) 
i = lVv 


Differentiating with respect to a particular n^ , and equating to 
zero, we have for the r^ first staQ^ unit 



from which 


c) L _ 


3n. 


"°2r 

+ AMP 
P 

r r 


= 0 


and finally, 


1 ^2r 


/A P /M 
r 


M 

I Pi Hi 

i = l ^ ^ 


M 

i = l 

✓X AA 


±_ 

Ax 


h >4^ 
, M 


(17 


Then substituting for T//A in equation (17) and simplifying, 
we obtain 


n,. - n 


'2r 


M 

Pr I ^2i 

i=l 


Hence 


? 4 


0 


i=l P,n^ 


M- 

M ol. 

T J±. 1 L! 

1=1 '’i ™2i 




Consequently, after "optimal '' al locatiQn of secondary units . 


Var(v) = 


M 2 
in mh 


2 2 

°1 ^2 

-J_ + 

in mn 
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Now minimizing Var(v) with respect to m and h subject 
to the expected cost constraint of formula (16), we define 


2 2 

^1 - -* 

L - + + x(ntD, + mnD^ - D ) 

m nm I ^ 


which yields 


or 


Then 


9-L 

in 


--A- + AmDo = 0 
mn 


mh 


= A(D*-mD^) 


mn = 


X(D*-mD|) 


(18) 


Ik 

0m 




m m n 


S i n c e f r 0 n 1 e q u a t i 0 n ( 1 6 ) 


we have 


D, + nDj - - 


_* 


2 2 

XD = m[— 2 ~ + 

ni til h 

- 2^ 2 
no-i + 02 

: mh 


(19) 


Substituting for mh in equation (19) from equation (18), and 
simplifying, we obtain 


-2^ 2 
na-j + 02 


:D „ 

D* - inD 


mD^ + mnD2 


mnD2 


TOl 



Hence 


B 4 ) 


nD, 


Thus 





D’ 


D-j + n^D2 


and the optimum allocation is given by 




(19 a) 


or 


0 _ ^2r rl 

V - 


As it can be seen above, the evaluations of m°, h°, and 
depend on a^, and Since these would not ordinarily 

be known ahead of time, estimates would have to be obtained from 
similar previous surveys or from a pilot survey. The estimating 
equations would be 


r V . 2 
rj- 


®2r " ^nj,(n^-l 


1/2 




and 


-2 _ 1 ^2i 

^ 2 m p. 
1=1 '^1 


S? = n,(var(v)) -i J ^ 1^1 
’ ) 1 = 1 "1 
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9 

where var(v) is obtained from equation (4) and the ol^'s are 
obtained from equation (20). • i. 

Three^Staqe Sampl i ng 

From equation (12) we obtain the variance of the three-stage 
pps estimator 


Var(v) = 


-1 + i y 

m m 6 


2 N. 2 

y - _L Y 

m n. P. m n. -S t.. P.P.. 
1=1 1 1 1=1 1 j=l • ij 1 ij 


'1 , 1 


M a 


+ - y 

m in n^ 


2i +1 


M 

I 


N. ^2 

i ^ 


m n. tv. 
1=1 1 j=i ij 


( 21 ) 


2 

in which 02 ^. 


21 


3ij 


2 ^ 

“ p~p ~ > m and n . are as defined 
i ij 


• I'u th 

above, and is the number of third stage units in the ' j 

second stage unit within the i^^ first stage unit. We then intro 
duce the simple linear cost constraint 


m 


m *^i 


D - iiiD 


1 + (,I ",)d2 + ( I I gilD, 

1=1'^^ i=l j = l ’J 


in which D.j, D 2 , and are the costs attributable to measur- 
ing the first, second and third stage units fespectively; 



The expected cost is 


D = E(D) = niD^ + mnD 2 + mhtD^ 


We seek first the optimal values of t.. and t in the third 
stage and then progress to the second stage values of n- and h. 
Let = N^t be the total number of third stage units in the i^^ 
first stage unit. Then 


N. ~2 
^ ° 3 ij 


M 

M 1 


L = I + a(n I I P P t. . -T.: 

j=i 1 1 ij ij 1 


Differentiating with respect to a particular t^.j, we have 


8 L 

9 t. 

vr 




= 0 


tt 


ir 


and 


^2 

3 ir 


’ 3 ir 


ir 


XNiPiPi^ 


7 7 

ANiP^P^r 


Hence 


^ir 


_ 1 3 ir 


v/A P.P. /nT 
1 rr 1 


M N. 

M 1 


t = 


i=l j=T ■ /X i=l j=l /N, 


. N. 

- I i 


and 


1 _ 


M a . 

l l ^ 

i=i j=i 
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^ h 

Hence the optimum number of third stage units in the r^^ second 
stage unit is given by 


^3ir 


M ^i 0^.. P.P. /N. 


i=T j=l /ny' 


Thus in the third term of the-variance, we have 


I I r L -7H] 

0 = 1 0=1 '’Tijt‘’3io i-l /N^ 


M N. N. 

V M 1 0, . . 1 

I r v r ilj' 


= |-( I I -^) I OsupT" 

t i=l 0=1 /N. 0 = 1 

A, 


(22) 


Thus 


Var(v) = 


0 

m 


1 + 1 y + J 

^ ni n, „i't -i: 


i=l ”t 


M A. 

^ y -i 

^ 1=1 ^i 


2 ~2 ^i 

Ov , M 05. +-r^ 

= J- 4- 1 y _^l__t_ 

m n. 


Working with the second stage units, let N = Mh. Then we 
wish to minimize the total variance, 


L = 


M 0 ^ +-f M 
I. x(M I.P,n,-N) 


r=i 


r=l 


with respect to hj„. Accordihgly, we have 



IL 


“2 — 
n„ 


+ AMP = 0 


Hence 


2 ~2 

AMP = ai + -- 
r r 2r i 


. A P 

xJ:' r 
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Solving for n^ we obtain 


, AP 1/2 
r 2 r r^ 

_ 1 hr"~r~) 

/X p M 
r 


(24) 


which then yields 


M . A.P. 1/2 

M , Ir Ki 

R = lP,n. 

i=l ' ' M 


or 


_L 

A 


n /M 


M 


J/oy+A.P./t) 


nl/2 


(25) 


Then using equations (24) and (25) we can express the niiniiiiizing 


(optiinuin) value of n^ as: 


- foo^ + A P /t) 
0 n '' 2T r r' ■' 


?n1/2 


"^r " p M . 

r V- ( 2 


,no^i^A,p^/t) 


1/2 


(26) 


By substituting equation (26) into equation (23) and simplifying, 
we obtain 
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Var(v) = 




1/2- 


i=l 


in 


nint 


The above solution assumes that no serious "stability" problems 
arise from having t within the radical. However, should this be 
a problem, the sample size can be held constant at the second 
stage so as to eliminate this problem. In the class of estimators 
discussed in this paper, a three-stage design would entail space 
and aerial photo interpretation in the first and second stages. 

The field data would be obtained in the third stage only. Since 
it is the field samples which usually are niore costly than photo 
samples in forest surveys, it is important to have a near optimal 
allocation of field plots to photo plots, taking advantage of the 
variable sizes of the photo units if possible. On the other hand, 
the aHocation of second stage photo plots to first stage photo 
plots that accounts for the exact variations in the size of the 
primary units may not be of extreme importance. Therefore, to keep 
the complexity of the problem within reasonable bounds, we will 
hold the second stage sample size (n) constant and proceed from 
there. In this simplified case, we express the variance as 


Var(v) = + 


mn 


: M 

I 

i^l 


'2i 


mn 


a-, 


m 


): 


1 


M 

t. 

T=l J=1 1J 


_!3iL 
p."p. . 
T TJ 


+ — y a,. + — y 

ran 2i mn 


. ..2 
1 o^ . . 
V jy 

^ t-. . 

u 


;27) 


The accompanying cost function is 
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Ill n 

D = inD, + iiinD^ + ( y y t. .]Dt 


wi th 


M ^i 


= I ypp..t.. = t 

i=l j=i 1 'J 1J 


and 


D* = E{D) = niD^ + mnD2 + nintD^ 


As before, let = N^t. Then allocating ''optimal ly''i gives 


t° 

ir 


Sir 


N'. 


I 'i 

i =1 j=l /N . 


(28) 


and from formula (27) 


N . _2 

3=1 ij 


1 I” i ®3ii ' 

i( r I ^1 l 

^ i=l j = l /n7 j=l ^ 


A. 


as before. Hence 


Var(y) = 


M p . M : ■ 


!i + _i 

i=r2i Hint 


Ill mn Hint 


(29) 


Minimizing Var(v) with respect to in, n (constant) and t 
subject to the cost constraint gives 


f° := 


/EnD. 


(30) 
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( 32 ) 



/Eyo^ + *^^ 2^2 ^^3^3 


These results, although arrive(j at differently, agree with Murthy 
(1967, p. 347). The variable allocation to third stage units 
would be according to equation (28) and an appr.oximation of the 
variable allocation of second stage units to first stage units can 
be made according to equation (26). However, these two equations 
require population values that are interconnected through all 
stages of the design and would not normally be available. There- 
fore, approximate estimators for these values are given below. 

In order to use a variable allocation which depends on unit 
sizes, we shall need estimators for the optimum values of n^ and 

t|^. Incorporated within these are the requirements for estimators 

2 

of the A^'s, the 02 .j^'s and the C 2 ^'s. Accordingly, we have 
the estimates 


and 


"r P 




1 m -1 ry 

r 1 V I 
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tv = 

IT 


°3ir 
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1 y _L_ ^ V /^3ijy PiPjr^ 
m P^n. p..( 
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n . '' nr 

■> 1 1 CTot ■!. T , 

a //N 

i'=l jil Pij /NT' "r jSl Prj 3>-0 r 
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The third stage variances in the above fontiula can be esti- 
mated from a pilot sample or previous survey using the same form 

of estimator given in equations (4) and (20).. To obtain estimates 
2 

of the 0 ' 2 ^'s, consider the second stage samples and their respec- 
tive third stage subsamples as m two-stage samples. Then 


Hr '^r^''ar(v)^} 


"r jh S 


in which var(v)^ is the estimated total sample variance in the 
r^*^ first-stage unit. 


Finally, to estimate 
2 ^ 2 

needed for a-, , J . , 


follows: 


m°, n°, and t° estimates are first 
M 

and ^ A-. These may be obtained as 
' ■ i-1 . 






APPENDIX C 


DERIVATION OF THE VARIANCES 

Two-Stage Variable Probability Sampling with Replacement 
From the basic definition of variance, we have 

Var(v) = E[v - E(v)]^ 

= E[v - V]^ 

= E[v^] - V^ (33) 

By substituting the sample estimate of v (equation 2) Into (33), 
we obtain 


Var(v) = E[(i i 


■ni Pi 


m 


m 


m V. 2 

4. p. 


1 Ml V. C. n 


2 

m V . 


m 


EC I I 

i=l p^ ij^i 


Pi P,-. 


-] 



By expressing each of the v^.'s in terms of the second stage esti- 
mators , we have 


Var(v) = 


111 


£{E [ r I - 
p\n\ j=l P 


"i V.. 2 

li) 


IJ 


in I /“i v.. 

y r_L_ y r 


I 


1 j 


l/l ^11 j = l ' 1 ' j = l ^v' j 


- V' 


in which E| is the conditionaT expectation of second stage ele- 
ments holding the first stage sample fixed, Before evaluating the 


in 



expectations in the second stage, however, we must first decompose 
V..2 

( 1 “■) ^"^to its sums of squares and cross products. Hence 

j=1 *^ij 


1 III -1 I V ♦ • III -I 

Var(v) = I I ^ I ^ 

m ^ i=l p^n^ Pij Pi"i 


m 


1 "v* "ij 


m 


*^1 V . , V. 


m 


"i V„, 


^ I I 


' i'^)]}-v2 


i^^i ' Pi'^i j = l Pi j *^i *^i ' j=l Pi ' j 


(34) 


T m , V.. m n.-l 

= iE[ I E. 


Ill i=l p.n. p 
1 


V . . V . . , 

y E . J!A _I1'_ 

’Pi: P1J' 


m T V . . 

+ r,(rEi r'-)( 

P, 1 P^j 


V,, 


^ Jj-) E. , pr^)] - V‘ 
i^i ' Pi Pi j Pi ' Pi ' j 


The conditional expectations of the cross products in the second 
term within the brackets may be evaluated separately because sub- 
sampling within clusters is with replacement. The conditional 
expectations may be taken separately in the third term because sub- 
saiiipl ing wi thin one cluster is independent of the subsanipling within 
all other clusters. Hence, 


, m , V?, rrn.-l N, 

VarCv) = J-E[ I “2^ I t l 'Jfy') 

m i=l p^n* j=l' ij i=l p^n. j=l j'=l 


m ^ 'i 


N,. 


+ I I V I V. .)] 

Pi i=l Pi ■ i'=l ^ J 


-r 


T ni n ^i V?. ni n.-l - m V. V., . - 

): ^ i r 1 -|-v + I pi pi-] - 
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. u. , V^. Ill V? 
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I ? L p ' L 0 ~ L 0 1. p p 
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How, evaluating the unconditional expectation over all population 
clusters, we have 


Var(v) = -\[mE 


1 


v?. 

)' niE 


2 P. . 

p^n. j=l ij 


Pi 


''r 

niE -p — + iii(ni-l )E j,— ] 

Pin. 


'l''! 

p. p. 
11 


V 


Remembering that the expectations of the cross product in the third 
term are evaluated separately because of independence due to select- 
ing the first stage clusters with replacement, we have 


, M ' ^i V?. m V? M V? 

^ ^ I ^ I :^ - m I ^ 


Var(v) = ^[m [ p ^ l p 
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III 4 D 


2 

M Vf 


ij\, j 'i 1 V -1 


i=rPi^ j=l Pij '^i=^ ^ 


+ V 


Yi - v2 

m 


Finally, by rearranging terms and factoring,: we obtain Var(v) 
for two stage sampling with replacement, i.e. 


■ M vi „2 , M 


1 


111 P .n . P . . 
1=1 1 T j=l ij 


'*i V^. , M V? 

Y 'll 1 Y ^ 
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1 = 1 11 
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if Y _L_v^l + i Y — y 

p, '' y ^ m i', P,n,\iv p, . 

1=1 1 1=1 1 1 j=] 1 j 

1^^ stage 2^^ stage 
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Three-Stage Variable Probability Sampling with Replacement 

In deriving Var(v) for three stage probability sampling, 
we return to equation (34), substitute the third stage estimators 
in place of the v..'s, and proceed as before: 

. * J 


I f r- rv r V 1 r ! r 1 J K- 


Var (V) :{E [E [ ); ^ T ^Vf 5^") 
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Pi‘"f' j=l Pi'j^i'j k=l Pi'jk 


In evaTuating the expectation, we first find the conditional 
expectation of the third stage, holding the units of the first two 
stages constant. SecondTy , we find the condi ti onal expectation 
over all second stage units, holding the first stage units con- 
stant.' Finally, we evaluate the uhconditionaT expectation over 
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all first stage units. During the process, we keep in mind that 
all cross product terms contain independent random variables due 
to sampling with replacement at all levels and independently 
selected subsamples. In evaluating the expectations , vie have 
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1 1 I t V . . . 
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Finally, by rearranging terms and factoring, we have 
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Derivation of the Estimate of the Variance 

It was stated earlier (in Appendix A) that regardless of 
the number of stages in variable probability sampling, an unbiased 
estimate of Var(v) can be obtained from the estimates at the 
first stage. The formula may be written as , 

2 

/\ 1 ni V. 2 

-2-'"'' t (35) 

^ ^i 

in which, 


V is the estimated population total obtained by 
pps sampling, 

■f* K 

v^ is the estimated total of the i^^ first stage unit, 

t h 

p^ is the probability of selecting the i^^ first stage 
unit, and 

m is the number of primary units in the sample. . 


The proof of equation (35) is given below for two-stage variable 
probability sampling to illustrate the principle, i.e. 
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Substituting the second stage estimates for the v/'s, we have 
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We now take the conditional expectation of the second stage sample 
holding the first stage sample fixed. 
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The 
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unconditional expectation over all first stage samples is then 
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